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^'£S. UANT,TATIVE ANALYSIS OF PROTEINS 
OR PROTEIN FUNCTION IN COMPLEX MIXTURES 

This invention was made through funding from the National Science 
Foundation Science and Technology Center for Molecular Biotechnology (grants 
5T32HG and BIR9214821) and the National Institutes of Health (NIH grants 
RR11823, T32HG00035, HD-02274 and GM60184). The United States government 
has certain rights in this invention. 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application takes priority under 35 U.8.C.§1 1 9(e) from U.S. provisional 
apphcations serial no. 60/097,788, filed August 25, 1998 and serial no.60/099 113 
filed September 3, 1998, both of which are incorporated in their entirety by reference 
herein. 



BACKGROUND OF THE INVENTION 
Genomic technology has advanced to a point at which, in principle, it has 
become possible to determine complete genomic sequences and to quantitatively 
measure the mRNA levels for each gene expressed in a cell. For some species the ' 
complete genomic sequence has now been determined, and for one strain of the 
yeast Saccftaremyces cerWa/ae, the mRNA levels for each expressed gene have 
been precisely quantified under different growth conditions (Velculescu et al ig 97 ) 
Comparative CDMA array analysis and related technologies have been used to 
determine induced changes in gene expression at the mRNA level by concurrently 
monitoring the expression level of a large number of genes (in some cases all the 
genes) expressed by the investigated cell or tissue (Shalonetal 1996) 
Furthermore, biological and computational techniques have been used to correlate 
specflc function with gene sequences. The interpretation of the data obtained by 
» se techniques in the context of the structure, control and mechanism of biological 
systems has been recognized as a considerable challeng . In particular, it has been 
xtremely difficult to explain the mechanism of biological processes by g nomic 
analysis alone. 
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Proteins are essential for the control and execution of virtually every biological 
process. The rate of synthesis and the half-life of proteins and thus their expression level 
are also controlled post-transcriptionally. Furthermore, the activity of proteins is frequently 
modulated by post-translational modifications, in particular protein phosphorylation, and 
dependent on the association of the protein with other molecules including DNA and 
proteins. Neither the level of expression nor the state of activity of proteins is therefore 
directly apparent from the gene sequence or even the expression level of the 
corresponding mRNA transcript. It is therefore essential that a complete description of a 
biological system include measurements that indicate the identity, quantity and the state of 
activity of the proteins which constitute the system. The large-scale (ultimately global) 
analysis of proteins expressed in a cell or tissue has been termed proteome analysis 
(Pennington et al., 1997). 

At present no protein analytical technology approaches the throughput and level of 
automation of genomic technology. The most common implementation of proteome 
analysis is based on the separation of complex protein samples most commonly by two- 
dimensional gel electrophoresis (2DE) and the subsequent sequential identification of the 
separated protein species (Ducret et al., 1998; Garrels et al., 1997; Link et al., 1997; 
Shevchenko et al., 1996; Gygi et al. 1999; Boucherie et al., 1996). This approach has 
been revolutionized by the development of powerful mass spectrometry techniques and 
the development of computer algorithms which correlate protein and peptide mass spectral 
data with sequence databases and thus rapidly and conclusively identify proteins (Eng et 
al., 1994; Mann and Wilm, 1994; Yates et al., 1995). This technology has reached a level of 
sensitivity which now permits the identification of essentially any protein which is detectable 
by conventional protein staining methods including silver staining (Figeys and Aebersold, 
1998; Figeys et al., 1996; Figeys et al., 1997; Shevchenko et al., 1996). However, the 
sequential manner in which samples are processed limits the sample throughput, the most 
sensitive methods have been difficult to automate and low abundance proteins, such as 
regulatory proteins, escape detection without prior enrichment, thus effectively limiting the 
dynamic range of the technique. In the 2DE/(MS)N method, proteins are quantified by 
densitometry of stained spots in the 2DE gels. 
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The development of methods and instrumentation for automated, data-dependent 
electrospray ionization (ESI) tandem mass spectrometry (MS") in conjunction with 
microcapillary liquid chromatography (pLC) and database searching has significantly 
increased the sensitivity and speed of the identification of gel-separated proteins As an 
alternafive to the 2DE/MS" approach to proteome analysis, the direct analysis by tandem 
mass spectrometry of peptide mixtures generated by the digestion of complex protein 
mixtures has been proposed (Dongre et al„ 1997). pLC-Ms/MS has also been used 
successfully for the large-scale identification of individual proteins direcUy from mixtures 
without gel electrophoretic separation (Link et al„ 1999; OpKek et al.. 1997) While these 
approaches dramatically accelerate protein identification, the quantities of the analyzed 
proteins cannot be easily determined, and these methods have not been shown to 
substantially alleviate the dynamic range problem also encountered by the 2DBMS/MS 
approach. Therefore, low abundance proteins in complex samples are also difficult to 
analyze by the uLC/MS/MS method without their prior enrichment. 

It is therefore apparent that current technologies, while suitable to identify the 
components of protein mixtures, are neither capable of measuring the quantity nor the state 
of activity of the protein in a mixture. Even evolutionary improvements of the current 
approaches are unlikely to advance their performance sufficiently to make routine 
quantitative and functional proteome analysis a reality. 

This invention provides methods and reagents that can be employed in preteome 
analysis which overcome the limitations inherent in traditional techniques. The basic 
approach described can be employed for the quantitative analysis of protein expression in 
complex samples (such as cells, tissues, and fractions thereof), the detection and 
quantitation of specific proteins in complex samples, and the quantitative measurement of 
specific enzymatic activities in complex samples. 

in this regard, a multitude of analytical techniques are presently available for clinical 
and d,agnostic assays which detect the presence, absence, deficiency or excess of a 
protein or protein function associable with a normal or disease state. While these 
techniques are quite sensitive, they do not necessarily provide chemical speciation of 
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products and may, as a result, be difficult to use for assaying several proteins or enzymes 
simultaneously in a single sample. Current methods may not distinguish among aberrant 
expression of different enzymes or their malfunctions which lead to a common set of clinical 
symptoms. The methods and reagents herein can be employed in clinical and diagnostic 
assays for simultaneous (multiplex) monitoring of multiple proteins and protein reactions. 

SUMMARY OF THE INVENTION 
This invention provides analytical reagents and mass spectrometry-based methods 
using these reagents for the rapid, and quantitative analysis of proteins or protein function 
in mixtures of proteins. The analytical method can be used for qualitative and particularly 
for quantitative analysis of global protein expression profiles in cells and tissues, i.e. the 
quantitative analysis of proteomes. The method can also be employed to screen for and 
identify proteins whose expression level in cells, tissue or biological fluids is affected by a 
stimulus (e.g., administration of a drug or contact with a potentially toxic material), by a 
change in environment (e.g., nutrient level, temperature, passage of time) or by a change 
in condition or cell state (e.g., disease state, malignancy, site-directed mutation, gene 
knockouts) of the cell, tissue or organism from which the sample originated. The proteins 
identified in such a screen can function as markers for the changed state. For example, 
comparisons of protein expression profiles of normal and malignant cells can result in the 
identification of proteins whose presence or absence is characteristic and diagnostic of the 
malignancy. 

In an exemplary embodiment, the methods herein can be employed to screen for 
changes in the expression or state of enzymatic activity of specific proteins. These 
changes may be induced by a variety of chemicals, including pharmaceutical agonists or 
antagonists, or potentially harmful or toxic materials. The knowledge of such changes may 
be useful for diagnosing enzyme-based diseases and for investigating complex regulatory 
networks in cells. 

The methods herein can also be used to implement a variety of clinical and 
diagnostic analyses to detect the presence, absence, deficiency or excess of a given 
protein or protein function in a biological fluid (e.g., blood), or in cells or tissue. The 
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m e,h 0d is p articularty usefu , in (he analysjs of comp(ex mfcrtures ^ proteins 
contatrang 5 or more distinct proteins or protein functions. 

for the ^ 7 n,iVe me,h ° d emPl0VS afflni ^ abeled P'°«n -active reagents that allow 
c he elecve tsolation of peptide foments or the products of reaction with a given 
prate* (e.g., praducts of enzymabc reaction, from complex mixtures. The isolated peptide 
fragments or reacbon products are characterise of the presence of a protein or the 

ated pep des or reaCon products ara characterized by mass spectrometric (MS) 
echmques. In particular, the sequence of isolated peptides can be determined using 
* MS <MS "> « * ■» of sequence database searchin 

techntques the ptotein f rom which the sequenced peptide originated can be identified. 
The reagents also provide for differentia, isotopic labeling of the isolated peptides or 
reason products which facilitates quantitative determination by mass specbomeby of the 
^amounts of proteins in different sample, Also, theuse of diftela,,yi«. 

~ T 38 ' n,emal fedli,ateS "~ e —*«'« * <•» absolute 

amounts of one or more prateins or raacbon products present in the sample. 

In genera,, the affinity labeled protein reactive reagents of this invention have thrae 

,r label (A) covaien,iy ,inked ,o a — ^ - <™> 

A-L-PRG 

ZtlT r differen,ia " y iS0,0PiCa " y **• b * . of one or mora 

atoms , ^ hnker ^ . stab , e ^ ^ ^ 

subsbtuted with deuteriums ore" with C". 

or non ^ ' " " m ° leCU ' ar hand,B * at binds covalenby 

on-covalenuy, to a capture reagent ,CR, Binding to CR fadiitates isolabon of 

pepbdes, substrates or raacbon products tagged or ,abe,ed with A. In specific 

embodiments, A is a strepavidin or avidinn. After affinfiy isolation of affinity tagged 

materials, some of which may be isotopicaily labeled, the interaction between A and the 



5 



WO 00/11208 



PCT/US99/19415 , 



capture reagent is disrupted or broken to allow MS analysis of the isolated materials. The 
affinity label may be displaced from the capture reagent by addition of displacing ligand, 
which may be free A or a derivative of A, or by changing solvent (e.g., solvent type or pH) 
or temperature conditions or the linker may be cleaved chemically, enzymatically, thermally 
or photochemically to release the isolated materials for MS analysis. 

Two types of PRG groups are specifically provided herein: (a) those groups that 
selectively react with a protein functional group to form a covalent or non-covalent bond 
tagging the protein at specific sites, and (b) those that are transformed by action of the 
protein, e.g., that are substrates for an enzyme. In specific embodiments, PRG is a group 
having specific reactivity for certain protein groups, such as specificity for sulfhydryl groups, 
and is useful in general for selectively tagging proteins in complex mixtures. A sulfhydryl 
specific reagent tags proteins containing cysteine. In other specific embodiments, PRG is 
an enzyme substrate that is selectively cleaved (leaving A-L) or modified (giving A-L-PRG') 
by the action of an enzyme of interest. 

Exemplary reagents have the general formula: 

A.B 1 -X 1 -(CH 2 ) n .[X 2 -(CH 2 ) m ] x -X 3 -(CH 2 ) p -X 4 -B 2 -PRG 

where: 

A is the affinity label; 

PRG is the protein reactive group; 

X 1 , X 2 , X 3 and X 4 , independently of one another, and X 2 independently of other X 2 in 
the linker group, can be selected from O, S, NH, NR, NRR' + , CO, COO, COS, S-S, 
SO, S0 2 , CO-NR', CS-NR', Si-O, aryl or diaryl groups or X 1 -X 4 may be absent, but 
preferably at least one of X 1 -X 4 is present; 

B 1 and B 2 , independently of one another, are optional moieties that can faciliate 
bonding of the A or PRG group to the linker or prevent undesired cleavage of those 
groups from the linker and can be selected, for example, from COO, CO, CO-NR', 
CS-NR' and may contain one or more CH 2 groups alone or in combination with other 
groups, e.g.( CH 2 ) p -CONR', (CH 2 ) q -CS-NR\ or (CH 2 ) q ; 
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n, m. p and q are whole numbers that can have values from 0 to about 100 

preferably one of n, m, p or q is not 0 and x is also a whole number that oan range 

from 0 to about 1 00 where the sum of n + xm +P+ q is preferably less than about 100 

and more preferably less than about 20; 

R is an alkyl, alkenyl. alkynyl, alkoxy or aryl group; and 

R' is a hydrogen, an alkyl, alkenyl, alkynyl, alkoxy or aryl group. 

( ci C6 > IT T ? tne CH * 9roups 0,the linker can be °™ »i, h — 

or ? xy 9roups ' an aryl 9raup or can be subs,ituted •* 

groups that promote .omzabon, such as acidic or basic groups or groups carrying 

ZT T T " ne8atiVe ^ ° f m ° re Sin9le b °" dS «*-*» groups 
.the nker can be replaced with a double or a Iriple bond. Preferred R and R' a J 

alkenyl, alkyny, or alkoxy groups are small having 1 to about 6 carbon atoms. ' 

One or more of the atoms in the linker can be substituted with a stable isotope to 
generate one or more substantia,,, chemically identical, bu, isotopically distinguishable 
ea ents. For example, one or more hydrogens in the ,inker can be subs« ut ed with 
deutenum to generate isotopically heavy reagents. 

In an exemplary embodiment the linker contains groups that can be cleaved to 

27, T* ,a9 ' " 3 CleaVable ' inker STOUP " empto ^ d ' » is cleaved after 

TZ k ' SUbStra,eS ° r rea ° ,i0n Pr ° dUC,S have bee " "-"g *e 

ami t 7T ^ WS ° aS6 ' "» iSOtopi ' ^ »• 

bound to the pnotein, peptide, substrate or reaction product. 

diamine ink . ergr0UPS inC ' U ° e am ° n9 0 *"* """diamines, polyether 

a, en I h ,' ne5, ^ ***** di » -* «»ers, aly or 

ke y cha,ns (straight chain or branched and portions of which may be cyclic,, a J diary, 

~ *" 9r ° UPS k8re ~ "~ 1 « * ™ (e.g. N O 
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In one aspect, the invention provides a mass spectrometric method for identification 
and quantitation of one or more proteins in a complex mixture which employs affinity 
labeled reagents in which the PRG is a group that selectively reacts with certain groups 
that are typically found in peptides (e.g.,sulfhydryl, amino, carboxy, homoserine lactone 
groups). One or more affinity labeled reagents with different PRG groups are introduced 
into a mixture containing proteins and the reagents react with certain proteins to tag them 
with the affinity label. It may be necessary to pretreat the protein mixture to reduce 
disulfide bonds or otherwise facilitate affinity labeling. After reaction with the affinity labeled 
reagents, proteins in the complex mixture are cleaved, e.g., enzymatically, into a number 
of peptides. This digestion step may not be necessary, if the proteins are relatively small. 
Peptides that remain tagged with the affinity label are isolated by an affinity isolation 
method, e.g., affinity chromatography, via their selective binding to the CR. Isolated 
peptides are released from the CR by displacement of A or cleavage of the linker, and 
released materials are analyzed by liquid chromatography/mass spectrometry (LC/MS). 
The sequence of one or more tagged peptides is then determined by MS" techniques. At 
least one peptide sequence derived from a protein will be characteristic of that protein and 
be indicative of its presence in the mixture. Thus, the sequences of the peptides typically 
provide sufficient information to identify one or more proteins present in a mixture. 

Quantitative relative amounts of proteins in one or more different samples containing 
protein mixtures (e.g., biological fluids, cell or tissue lysates, etc.) can be determined using 
chemically identical, affinity tagged and differentially isotopically labeled reagents to affinity 
tag and differentially isotopically label proteins in the different samples. In this method, each 
sample to be compared is treated with a different isotopically labeled reagent to tag certain 
proteins therein with the affinity label. The treated samples are then combined, preferably in 
equal amounts, and the proteins in the combined sample are enzymatically digested, if 
necessary, to generate peptides. Some of the peptides are affinity tagged and in addition 
tagged peptides originating from different samples are differentially isotopically labeled. As 
described above, affinity labeled peptides are isolated, released from the capture reagent 
and analyzed by (LC/MS). Peptides characteristic of their protein origin are sequenced using 
MS" techniques allowing identification of proteins in the samples. The relative amounts of a 
given protein in each sample is determined by comparing relative abundance of the ions 
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generated from any differentially labeled peptides originating from that protein. The method 
can be used to assess relative amounts of known proteins in different samples Further 
smce the method does not require any prior knowledge of the type of proteins that may be 
present ,n the samples, it can be used to identify proteins which are present at different 
levels ,n the samples examined. More specifically, the method can be applied to screen for 
and tden.ify proteins which exhibit differential express in cells, tissue or biological fluids It is 
a so possible to determine the absolute amounts of specific proteins in a complex mixture. In 
th,s case, a known amount of internal standard, one for each specific protein in the mixture to 
be quantified, is added to the sample to be analyzed. The internal standard is an affinity 
tagged peptide that is identical in chemical structure to the affinity tagged peptide to be 
quantified except that the internal standara is differentially isotopically labeled, either in the 
pept.de or ,n the affinity tag portion, to distinguish it from the affinity tagged peptide to be 
quanbfied. The interna! standard can be provided in the sample to be analyzed in other 
ways. For example, a specific protein or set of proteins can be chemically tagged with an 
rsotopically-labeled affinity tagging reagent. A known amount of this materia, can be added 
to the sample to be analyzed. Alternatively, a specific protein or set of proteins may be 
labeled with heavy atom isotopes and then derivatized with an affinity tagging reagent. 

Also, it is possible ot quantify the levels of specific proteins in multiple samples in a 
Single analysis (multiplexing,. In this case, affinity tagging reagent s used to derivatize 
proteins present in different affinity tagged peptides from different samples can be selectively 
quantified by mass spectrometry. 

In this aspect of the invention, the method provides for quantitative measurement of 
specfic proteins in biological fluids, cells ortissues and can be applied to determine giobal 
protetn expression profiles in different cells and tissues. The same genera, strategy can be 
broadened to achieve the proteome-wide. qualitative and quantitative analysis of the state of 
mod,ficabon of proteins, by employing affinity reagents with differing specificity for reaction 
w,th proteins. The method and reagents of this invention can be used to identify low 
abundance proteins in complex mixlures and can be used to selectively analyze specific 
groups or classes of proteins such as membrane or cell surface proteins, or proteins 
contamed within organelles, sub-cellular fractions, or biochemical fractions such as 



9 



WO 00/11208 



PCTAJS99/19415 .. 



immunoprecipitates. Further, these methods can be applied to analyze differences in 
expressed proteins in different cell states. For example, the methods and reagents herein 
can be employed in diagnostic assays for the detection of the presence or the absence of 
one or more proteins indicative of a disease state, such as cancer. 

In a second aspect, the invention provides a MS method for detection of the presence 
or absence of a protein function, e.g., an enzyme activity, in a sample. The method can also 
be employed to detect a deficiency or excess (over normal levels) of protein function in a 
sample. Samples that can be analyzed include various biological fluids and materials, 
including tissue and cells. In this case, the PRG of the affinity labeled reagent is a substrate 
for the enzyme of interest. Affinity labeled substrates are provided for each enzyme of 
interest and are introduced into a sample where they react to generate affinity labeled 
products, if the enzyme of interest is present in the sample. Products or unreacted substrate 
that are tagged with the affinity label are isolated by an affinity isolation method, e.g., affinity 
chromatography, via their selective binding to the CR. The isolated tagged substrates and 
products are analyzed by mass spectrometry. Affinity labeled products include those in 
which the substrate is entirely cleaved from the linker or in which the substrate is modified by 
reaction with a protein of interest. Detection of the affinity-labeled product indicates the 
protein function is present in the sample. Detection of little or no affinity labeled product 
indicates deficiency or absence, respectively, of the protein function in the sample. 

The amount of selected protein, e.g., measured in terms of enzyme activity, present in 
a sample can be measured by introducing a known amount of an internal standard which is 
an isotopically labeled analog of the expected product of the enzymatic reaction of the 
reagent substrate. The internal standard is substantially chemically identical to the expected 
enzymatic reaction product, but is isotopically distinguishable therefrom. The level of protein 
function (e.g., enzymatic activity) in a given sample can be compared with activity levels in 
other samples or controls (either negative or positive controls). The procedure therefore can 
detect the presence, absence, deficiency or excess of a protein function in a sample. The 
method is capable of quantifying the velocity of an enzymatic reaction since it enables the 
amount of product formed over a known time period to be measured. This method can be 
multiplexed, by simultaneous use of a plurality of affinity labeled substrates selective for 
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interna, standards for expected products, to analyze for a piuraiity of profein functions in a 
single sample. 



DETAILED DESCRIPTION OF THE INVENTION 

which r nr of ,his inven,ion emp,oy •** ,a " ed pro,ein ~<»» - 

*ch the affinity tag ,s covalently attached to a protein reacttve group by a linker. The linker 
can be ,so,op,ca„ y labeled to generate pairs or sets of reagents that are substantially 
c^micalty identical, but which are distinguishable by mass. For example a pair of reagents, 
on of wh,ch ,s ,so,opica„y heavy and the other of which is isotopically llght can be employed 
for.be companson of two samples one of which may be a reference sample containing one 

0 more known proteins in known amounts. For example, any one or more of the hydrogen 
nitrogen, oxygen or sulfur atoms in the linker may be replaced with .heir isotopically stable ' 
isotopes: H »C, IS N, "O, "O or "S. 

^T b>e ! fflnity ,39S SeleC,iVe ' y 6i,her C ° Valen,ly ° r ~alentty and with high 
affintty to a capture reagent <CR). The CR-A interaction or bond should remain intact after 
extensive and multiple washings with a vahety of solutions to remove non-specffically bound 
components. The affinity tag binds minjmally or p „ y M „ - ^ ^ > 

assay system, except CR, and does no. siginftcantly bind to surfaces of reactton vessels 
Any non-speciftc interaction of the affinity tag with other components or surfaces should be 
d sruptod y muttiple washes that leave CR-A intact Further, i, mus. be possible to disn.pt 
he in eractton of A and CR to release peptides, substrates or reaction producta for example 
by addttion ofadisplacingligand or by changing .he temperature or solvent condlT" 
Pre ferably. neither CR or A react chemically wtth other components in the assay system and 
b^oups .houlc be chemically stable over the time period of an assay or expeLnt 

T e alt i n' r V d06S UnderS ° PeP,We - |ike fr3 ~ to " ^ <M8r analysis 

la n IT 'l Preferab ' y S0 ' Uble ^ ^ liqUid 10 bS a « a "< *• Should 
remain soluble ,n .he sample liquid even .hough attached to an insoluble resin such as 

Agarose. In the case of CR term soluble means that CR is sufficiently hydrated or otoerwise 

1 T ^ " taC "° nS Pr0Periy bindi " 9 ,0 A ' CR - a*—* -™a.es 
should no. be present in me sample to be analyzed, excep. when added to capture A 
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Examples of A and CR pairs include: 

d-biotin of structurally modified biotin-based reagents, including d-iminobiotin, which 
bind to proteins of the avidin/streptavidin, which may, for example, be used in the 
forms of strepavidin-Agarose, oligomeric-avidin-Agarose, or monomeric-avidin- 
Agarose; 

any 1 ,2-diol, such as 1 ,2-dihydroxyethane (HO-CH 2 -CH 2 -OH), and other 1,2- 
dihyroxyalkanes including those of cyclic atkanes, e.g., 1 ,2-dihydroxycyclohexane 
which bind to an alkyl or aryl boronic acid or boronic acid esters , such as phenyl- 
B(OH) 2 or hexyl-B(OEthyl) 2 which may be attached via the alkyl or aryl group to a solid 
support material, such as Agarose; 

maltose which binds to maltose binding protein (as well as any other sugar/sugar 
binding protein pair or more generally to any ligand/ligand binding protein pairs that 
has properties discussed above); 

a hapten, such as dinitrophenyl group, for any antibody where the hapten binds to an 
anti-hapten antibody that recognizes the hapten, for example the dinitrophenyl group 
will bind to an anti-dinitrophenyl-IgG; 

a ligand which binds to a transition metal, for example, an oligomeric histidine will bind 
to Ni(ll), the transition metal CR may be used in the form of a resin bound chelated 
transition metal, such as nitrilotriacetic acid-chelated Ni(ll) or iminodiacetic acid- 
chelated Ni(ll); 

glutathione which binds to glutathione-S-transferase. 

In general, any A-CR pair commonly used for affinity enrichment which meets the 
suitability criteria discussed above. Biotin and biotin-based affinity tags are preferred. Of 
particular interest are structurally modified biotins, such as d-iminobiotin, which will elute from 
avidin or strepavidin columns under solvent conditions compatible with ESI-MS analysis, 
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such as d„u,e acids containing 1 0-20% organic solvent. It is expected that d-iminobio.fr, 
togged compounds wil, elute in solvent, below pH 4. d-iminobiofin tagged protein reactive 
reagents can be synthesized by methods described herein for the corresponding biotin 
tagged reagents. 

A displacement ligand. DL. is optionally used to displace A from CR. Suitable DLs are 

«ZZ " T SamP ' eS Un ' eSS add6d ° L 5h ° Uld be ChemiCa '^ really 
a ,e ,n the sample to be analyzed and should no, react with or bind to components (other 

thanCR) rnsamples or bind non-specifioally ,o reaction vessel walls. DL preferably does 
no. undergo pe P ,ide-,ike fragmentation during MS ana,ysis, and its presence in sampie 
should no. significantly suppress .he ionization of tagged peptide, substrate or reacfion 
product conjugates. 

DL itself preferably is minimally ionized during mass spectrometry analysis and me 
format™ of rons composed of DL dusters is preferably minimal. The selection of DL 
depends upon the A and CR groups that are employed. In genera,. DL is selected to' 
drsplace A from CR in a reasonable time scale, a, most within a week of tts addition but 
more preferably within a few minu.es or up to an hour. The affinity of DL for CR should be 
comparable or stronger than .he affinity of the tagged compounds containing A for CR 
Furthermore. DL should be soluble in the solvent used during the elution of tagged 
compounds containing A from CR. DL preferably is free A or a derivative or structural 
mo , fiction of A. Examp.es of DL include. d-bio.in or d-biotin derivatives, particularly those 
con.a,n,ng groups that suppress cluster formation or suppress ionizafion in MS. 

The linker group (L) should be so|ub|e jn ^ ^ ^ ^ ^ ^ 
should be sfeble with respect to chemical reaction, e.g.. substantially chemically Inert wim 

zrr o, ,rr amp,e as we " as a and cr ^ ^ «*« - a *i 

frtmCR r T SPeCi " C in,eraC,i ° n ° f " W ' ,h CR ° r Wi » "» *— ofA 

from CR by a d.splacng ligand or by a change in temperature or solvent. The linker should 
b nd mlnima „ preferaWy no( a , a „ (o o(her gaam , n ^ (o ^ ^ 

surfaces or CR. Any non-specific interactions of the linker should be bmken after multiple 
washes whrch leave .he A-CR complex infect. Linkens preferably do not undergo peptide* 
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fragmentation during (MS) n analysis. At least some of the atoms in the linker groups should 
be readily replaceable with stable heavy-atom isotopes. The linker preferably contains 
groups or moieties that facilitate ionization of the affinity tagged reagents, peptides, 
substrates or reaction products. 

To promote ionization, the linker may contain acidic or basic groups, e.g., COOH, 
S0 3 H, primary, secondary or tertiary amino groups, nitrogen-heterocycles, ethers, or 
combinations of these groups. The linker may also contain groups having a permanent 
charge, e.g., phosphonium groups, quaternary ammonium groups, sulfonium groups, 
chelated metal ions, tetralky or tetraryl borate or stable carbanions. 

The covalent bond of the linker to A or PRG should typically not be unintentionally 
cleaved by chemical or enzymatic reactions during the assay. In some cases it may be 
desirable to cleave the linker from the affinity tag A or from the PRG, for example to facilitate 
release from an affinity column. Thus, the linker can be cleavable, for example, by chemical, 
thermal or photochemical reaction. Photocleavable groups in the linker may include the 1-(2- 
nitrophenyl)-ethyl group. Thermally labile linkers may, for example, be a double-stranded 
duplex formed from two complementary strands of nucleic acid, a strand of a nucleic acid 
with a complementary strand of a peptide nucelic acid, or two complementary peptide 
nucelic acid strands which will dissociate upon heating. Cleavable linkers also include those 
having disulfide bonds, acid or base labile groups, including among others, diarylmethyl or 
trimethylarylmethyl groups, silyl ethers, carbamates, oxyesters, thiesters, thionoesters, and 
a-fluorinated amides and esters. Enzymatically cleavable linkers can contain, for example, 
protease-sensitive amides or esters, p-lactamase-sensitive p-lactam analogs and linkers that 
are nuclease-cleavable, or glycosidase-cleavable. 

The protein reactive group (PRG) can be a group that selectively reacts with certain 
protein functional groups or is a substrate of an enzyme of interest. Any selectively reactive 
protein reactive group should react with a functional group of interest that is present in at 
least a portion of the proteins in a sample. Reaction of PRG with functional groups on the 
protein should occur under conditions that do not lead to substantial degradation of the 
compounds in the sample to be analyzed. Examples of selectively reactive PRGs suitable 
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for use in the affinity tagged reagents of this invenfon, include those which react with 
sundry, groups to tag proteins containing cysteine, those that react with amino groups 
carboxyiate groups, ester groups, phosphate reactive groups, and aldehyde and/or ketone 
reacbve groups or, after fragmentation with CNBr, with homoserine lactone. 

a ~, r„ T r°' r' iVe 9r ° UPS inC ' Ude eP ° XideS ' a - ha,03Cyl 9roup ' ni,riles ' su "<™ted alky, or 

utl h Th ma ' eimldeS - Amin ° reaCtiVe 9TOUPS " a9 amin ° 9r0UpS « - 
su fony, hal,des, .socyana.es, isothiocyanantes, active esters, including tetrafluoropheny, 

esters, and N-hydroxysuccinimidyi esters, acid halides, and acid anyhydrides. In addftion 

nZbh 9r ° UPS inC ' Ude aWehydeS " ket ° nes in ,he presence or absence 0f NaBH < - 

3* 



Carboxylic acid reactive groups include amines or alcohols in the presence of a 
coup ng agent such as dicyoiohexylcarbodiimida. or 2,3,5,6,e«ra fl uoropheny, trffluoroacetete 
a d „ the presence or absence of a coupling catalyst such as ^imethylaminopyndine; and 
transit™ metal-diamine complexes including Cu(ll)phenan.hroline 



lactone 



Ester reactive groups include amines which, for example, react with homoserine 



Fed,,, oTZuf hTTT SraUPS inC ' Ude Che ' a,ed me,a ' Where ' he me,al is ' ** — »* 
Fe(,ll) or Ga(lll), chelated to. for example, nitrilotriacetiac acid or iminodiacetic acid. 

Aldehyde or ketone reactive groups include amine plus NaBH. or NaCNBH, or these 
reagents after ftrs, treating a carbohydrate with periodate to generate an aktehyde or ketone. 

in,eres, P l G vT UPS ^ "* SUbS ' ra,eS * 3 " ^ °< mie ^ Th ° of 
, teres, may, for exampte, be one tha, is associated witn a disease state or birft, defect or 

2 7 " rr e ' y , m&diCai PUnP0S6S - EnZyme S ~ s °< ««* «» « 

wfth *e methods of this invention include, add phosphatase, alkaline phosphatase, alanine 

am ^transferase, amylase, angiotensin converting enzyme, aspartate aminotransferase, 

creatme k.nase, gamma-glutamyltransferase, lipase, lactate dehydrogenase and 
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glucose-6-phosphate dehydrogenase which are currently routinely assayed by other 
methods. 

The requirements discussed above for A, L, PRG, extend to the corresponding to the 
segments of A-L-PRG and the reaction products generated with this reagent. 

Internal standards, which are appropriately isotopically labelled, may be employed in 
the methods of this invention to measure absolute quantitative amounts of proteins in 
samples. Internal standards are of particular use in assays intended to quantitate affinity 
tagged products of enzymatic reactions. In this application, the internal standard is 
chemically identical to the tagged enzymatic product generated by the action of the enzyme 
on the affinity tagged enzyme substrate, but carries isotope labels which may include 2 H, 13 C, 
15 N, 17 0, 18 0, or 34 S, that allow it to be independently detected by MS techniques. Internal 
standards for use in method herein to quantitative one or several proteins in a sample are 
prepared by reaction of affinity labeled protein reactive reagents with a known protein to 
generate the affinity tagged peptides generated from digestion of the tagged protein. Affinity 
tagged peptides internal standards are substantially chemically identical to the corresponding 
affinity tagged peptides generated from digestion of affinity tagged protein, except that they 
are differentially isotopically labeled to allow their independent detection by MS techniques. 

The method of this invention can also be applied to determine the relative quantities of 
one or more proteins in two or more protein samples, the proteins in each sample are 
reacted with affinity tagging reagents which are substantially chemically identical but 
differentially isotopically labeled. The samples are combined and processed as one. The 
relative quantity of each taggged peptide which reflects the relative quantity of the protein 
from which the peptide originates is determined by the measurement of the respective 
isotope peaks by mass spectrometry. 

The methods of this invention can be applied to the analysis or comparison of multiple 
different samples. Samples that can be analyzed by methods of this invention include cell 
homogenates; cell fractions; biological fluids including urine, blood, and cerebrospinal fluid; 
tissue homogenates; tears; feces; saliva; lavage fluids such as lung or peritoneal lavages; 
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mixtures of biological molecules including proteins, lipids, cerbohydretes end nucleic 
eads generated by partial or complete fractionation of cell or tissue homogenates. 

The methods of this invention employ MS and (MS)" methods. While a variety of MS 
and (MS)" are availabel and may be used in these methods, Matrix Assisted Laser 
Desorption Ionization MS (MALDI/MS) and EleOospray Ionization MS (ESI/MS) methods 
are preferred. 



Quantitative proteome analysis 

This method is schematically IHustrated in Scheme 1 using a biotin labeled sulfhydryi-reacave 
reagen. for quantitative protein proffle measurements in a sample protein mixture and a 
reference protein mixture. The method comprises the following steps: 

Reduction. Disulfide bonds of proteins in the sample and referent* mixtures are 
reduced to free SH groups. The preferred reducing agent is tri-n-butylphosphine which is 
used under standard conditions. Al.erna.ive reducing agen.s Mud. mercaptoethylamine 
and drthiothreitol. „ required, this reaction can be performed in the presence of solubilizing 
agents including high concentrations of urea and detergents to maintain pmtein solubility 
The reference and sample protein mixtures to be compared are processed separately 
applying identical reaction conditions; 

Derivatization of SH groups with an affinity tag. Free SH groups are derivatized with 
the btotinylating reagent bio.inyl-iodoacetylamidyK7,10 trioxatridecanediamine the 
synthesis of which is described below. The reagent is prepared in different isotopically 
abeled forms by substitute, of linker atoms with stable isotopes and each sampte is 
denvatized with a different isotopically labeled form of the reagent. Derivatization of SH 
groups is preferably performed under slight* basic conditions (pH 8.5, for 90 min at RT For 
the quan«a.ive, comparative analysis o, two samples, one sample each (tended reference 
sample and sample) are derivatized as illustrated in Scheme 1 with the isotopically light and 
me ^topically heavy torn, of the reagent, respectively. Forthe comparative analysis of 
several samples one sample is designated a reference to which the other samples are 
related to. Typically, the reference sample is labeled with the isotopically heavy reagen. and 
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the experimental samples are labeled with the isotopically light form of the reagent, although 
this choice of reagents is arbitrary. These reactions are also compatible with the presence of 
high concentrations of solubilizing agents; 

Combination of labeled samples. After completion of the affinity tagging reaction 
defined aliquots of the samples labeled with the isotopically different reagents (e.g., heavy 
and light reagents) are combined and all the subsequent steps are performed on the pooled 
samples. Combination of the differentially labeled samples at this early stage of the 
procedure eliminates variability due to subsequent reactions and manipulations. Preferably 
equal amounts of each sample are combined; 

Removal of excess affinity tagged reagent. Excess reagent is adsorbed, for example, 
by adding an excess of SH-containing beads to the reaction mixture after protein SH groups 
are completely derivatized. Beads are added to the solution to achieve about a 5-fold molar 
excess of SH groups over the reagent added and incubated for 30 min at RT. After the 
reaction the beads are be removed by centrifugation; 

Protein digestion. The proteins in the sample mixture are digested, typically with 
trypsin. Alternative proteases are also compatible with the procedure as in fact are chemical 
fragmentation procedures. In cases in which the preceding steps were performed in the 
presence of high concentrations of denaturing solubilizing agents the sample mixture are 
diluted until the denaturant concentration is compatible with the activity of the proteases 
used. This step may be omitting in the analysis of small proteins; 

Affinity isolation of the affinity tagged peptides by interaction with a capture reagent. 
The biotinylated peptides are isolated on avidin-agarose. After digestion the pH of the 
peptide samples is lowered to 6.5 and the biotinylated peptides are immobilized on beads 
coated with monomeric avidin (Pierce). The beads are extensively washed. The last washing 
solvent includes 10% methanol to remove residual SDS. Biotinylated peptides are eluted 
from avidin-agarose, for example, with 0.3% formic acid at pH 2; 
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Analysis of the isolated, derivatized peptides by M LC-MS»or CE-MS"with data 
dependent fragmentation. Methods and instalment control protocols well-known in the art 
and described, for example, in Ducret et al., 1998; Figeys and Aebersold. 1998; Rgeys et al 
1996; or Haynes et al., 1998 are used. 

In this last step, both the quantity and sequence identity of the proteins from which the 
tagged peptides originated can be determined by automated multistage MS This is 
ach,eved by the operation of the mass spectrometer in a dual mode in which it alternates in 
success^ scans between measuring the relative quantities of peptides eluting from the 
caprflary column and recording the sequence information of selected peptides. Peptides are 
quantified by measuring in the MS mode the relative signal intensities for pairs of peptide 
ions of .dentical sequence that are tagged with the isotopically light or heavy forms of the 
reagent, respectively, and which therefore differ in mass by the mass differential encoded 
wrthm the affinity tagged reagent. Peptide sequence information is automatically generated 
by selecting peptide ions of a particular mass-to-chaige (nvz) ratio for collision-induced 
d,ssoci ation (CID) in the mass spectrome , er Qper ^ g ^ ^ ^ ^ 

997). Gygi, S.P. e. al. (1999), and Gygi , S . P . „ „. (1g9g)) The ^ ^ ^ 
then automatically correlated with sequence databases to identify the pretein from which the 
sequenced peptide originated. Combination of the results generated by MS and Ms- 
analyses of affinity tagged and differentially labeled peptide samples therefore determines 
the relative quantities as well as the sequence identities of the components of pretein 
mixtures in a single, automated operation. 

Results of this applying this method using the biotinylated sulfhydryl reagent and to 
the quantitative analysis of synthetic peptide samples, to the retative quantitation of the 
peptides ,n a pretein digest an the tandem mass spectral analysis of a derivatized peptide 
are shown in Fig. 1 , Table 1 , and Fig. 2, respectively. 

This method can also be practiced using other affinity tags and other pretein reactive 
greups, including amino reactive groups, carboxyl reacuve groups, or groups that react with 
nomosenne lactones. 
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The approach employed herein for quanitative proteome analysis is based on two 
principles. First, a short sequence of contiguous amino acids from a protein (5-25 residues) 
contains sufficient information to uniquely identify that protein. Protein identification by MS n 
is accomplished by correlating the sequence information contained in the CID mass 
spectrum with sequence databases, using sophisticated computer searching algorithms 
(Eng, J. et al. (1994); Mann, M. et al. (1994); Qin, J. et al. (1997); Clauser, K.R. et ali (1995)). 
Second, pairs of identical peptides tagged with the light and heavy affinity tagged reagents, 
respectively, (or in analysis of more than two samples, sets of identical tagged peptides in 
which each set member is differentially isotopically labeled) are chemically identical and 
therefore serve as mutual internal standards for accurate quantitation. The MS 
measurement readily differentiates between peptides originating from different samples, 
representing for example different cell states, because of the difference between isotopically 
distinct reagents attached to the peptides. The ratios between the intensities of the differing 
weight components of these pairs or sets of peaks provide an accurate measure of the 
relative abundance of the peptides (and hence the proteins) in the original cell pools because 
the MS intensity response to a given peptide is independent of the isotopic composition of 
the reagents (De Leenheer, AP. et al (1992). The use of isotopically labeled internal 
standards is standard practice in quantitative mass spectrometry and has been exploited to 
great advantage in, for example, the precise quantitation of drugs and metabolites in bodily 
fluids (De Leenheer, AP. et al. (1992). 

In another illustration of the method, two mixtures consisting of the same six proteins 
at known, but different, concentrations were prepared and analyzed. The protein mixtures 
were labeled, combined and treated as schematically illustrated in Scheme 1. The isolated, 
tagged peptides were quantified and sequenced in a single combined pLC-MS and pLC-MS n 
experiment on an ESI ion trap mass spectrometer. All six proteins were unambiguously 
identified and accurately quantified (Table 2). Multiple tagged peptides were encountered for 
each protein. The differences between the observed and expected quantities for the six 
proteins ranged between 2 and 12%. 

The process is further illustrated for a single peptide pair in Figs. 3A-C. A single scan 
of the mass spectrometer operated in MS mode is shown in Fig. 3A. Four pairs of peptide 
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ions chafed by toe mass differentia, encoded in the affintiy tagged reagent are 
detected ,n <h,s scan and indicated wtih their respective m/z vaiues. The scan shown was 

Z'lTnT h 5 ' 0V6r C ° UrSe °' ^ ° ne - h ° Ur '""matographic elution gradient, more 
*an 1200 such scans were automatically recorded. Fig. 3B shows an expanded view of the 
mass spectoum around the ion pair with m* ratios of 993.8 and 977.7. respectively Co- 

doT 7 " ? ,eCted maSS diffe,Bn,ial ° ,f0Ur Uni ' S POten,ial1 ^ idan,fe *• -s as a pair of 

T2 Til T ta " ed PeP,HeS °' WentiCal *«*»™ < mass «*"— - « a 
charge state oftwo). Fig. 3C shows the reconstructed ion chromatograms forthese two 

^eaes. The relative quantities were determined by integrating the contour of the respect 

peaks. The ra„o (light/heavy) was determined as 0.54 (Table 1). The peaks in the 

reconstructed ion chromatograms appear serrated because in every second scan the mass 

CID mass spectrum, of a selected peptide ion. These CID spectra were used to identify the 
proton rom wh,ch the togged peptides originated. Fig. 4A shows the CID spectrum 
recorded from the peptide ion with n* = 998 < ma rked with an arrow in Fig. 3A). Database 
searing w«h this CD spectrum Identified the protein as giyceraldehydelphosphato 
dehydrogenase (Fig. 4B) which was a member of the protein mixture. 

Several beneficial features of the this method are apparent. First, at least two 
peptides were detected from each protein in the mixture. Therefore, both quantitation and 
pn*e,n identification can be redundant. Second, the identified peptides all contained a, least 
one tagged C yste,nyl residue. The presence of the relative* rare cysteinyl residue in a 

1998T ™T add " i0na ' P ° WerfUl COnStraW f ° r da,ab3Se < S ^ S. e, a,. 

ulir T 9 , and SeleC,iV6 ennChmem °' Peptides signfflcantiy 

reduced the complex,* of toe peptide mixture generated by the concurrent digestion of six 
proto.ns. For this protein mixture, the complexity was reduced from 293 potential tryptic 
peptides to 44 tiyptic peptides containing a. leas, one cysteiny, residue. Fourth the peptide 
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Quantitative analysis of protein expression in different cell states 

The protein reactive affinity reagent strategy was applied to study differences in 
steady-state protein expression in the yeast, S. cerevisiae, in two non-glucose repressed 
states (Table 3). Cells were harvested from yeast growing in log-phase utilizing either 2% 
galactose or 2% ethanol as the carbon source. One-hundred ug of soluble yeast protein 
from each cell state were labeled independently with the isotopically different affinity tagged 
reagents. The labeled samples were combined and subjected to the strategy described in 
Fig. 1 . One fiftieth (the equivalent of approximately 2 ug of protein from each cell state) of 
the sample was analyzed. 

Glucose repression causes large numbers of proteins with metabolic functions 
significant to growth on other carbon sources to be minimally expressed (Ronne, H. (1995; 
Hodges, P.E. et al. (1999)). Growth on galactose or ethanol with no glucose present results 
in the expression of glucose repressed genes. Table 3 presents a selection of 34 yeast 
genes encountered in the analysis, but it contains every known glucose-repressed genes 
that was identified (Mann, M. et al. (1994). Each of these genes would have been minimally 
expressed in yeast grown on glucose. Genes specific to both growth on galactose (GAL1 , 
GAL10) as well as growth on ethanol (ADH2, ACH1) were detected and quantitated. 

The quantitative nature of the method is apparent in the ability to accurately measure 
small changes in relative protein levels. Evidence of the accuracy of the measurements can 
be seen by the excellent agreement found by examining ratios for proteins for which multiple 
peptides were quantified. For example, the five peptides found from PCK1 had a mean ratio 
±95% confidence intervals of 1.57±0.15, and the percent error was <10%. In addition, the 
observed changes fit the expected changes from the literature (Ronne, H. 1995; Hodges, 
P.E. et al. (1999)). Finally, the observed changes are in agreement with the changes in 
staining intensity for these same proteins examined after two-dimensional gel electrophoresis 
(data not shown). 

The alcohol dehydrogenase family of isozymes in yeast facilitates growth on either 
hexose sugars (ADH1) and ethanol (ADH2). The gene ADH2 encodes an enzyme that is 
both glucose- and galactose-repressed and permits a yeast cell to grow entirely on ethanol 
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rr D Xl aW ^ hiChemerS,heTCACyde(Fi9 - 5A '- 

Z Z , 12 reV6rSe reaC " 0n ~ 9 into 

.WMon of , ties isozymes . key ,o carbon utiiization in yeast (Ronne. H. (1995)) The 

accurate, measure differences in gene expression across famiiies o, iso« 
sometimes difBcu t usina cDNA arrav t»^h nl . '™' ls 

eta! moot* tk 7 y techn «' uesbe ^e of cross hybridization (DeRisi, J L 

e. al. (1997),. The method of this invention appiied as iiiustrated in Fig, succeeded in 
measunng gene expression for each isozyme even though ADH1 and ADH2 she e 9 "/ 
am,no acW (88,. nucleotide, seguence simiiarity. This was because the 1 J agged 

Z ht cal e><PreSSed * aPPr0Ximate ' y 2 -'° ld ^ levete «*« ^lactose 

was he canbon source compared with ethanoi. Ethanol-induction of ADH2 expressL 

resulted ,n more lhan 200,o,d increases compared with galactese-induction. 

The results described above illustrate that the method of this invention provides 
duanbtabve analysis of protein mixtures and the identities of the protein <Z2s 
therein in a single, automated operation. components 

complex^ oHhe 0 ! a5 ,H PP ' ied US '" 9 3 5UM,yd ' yl rea * 6 rea98nt *. 
selectively^ isolated ^ "T b6CaUSe ^ -P«- are 

em 1 1 T h 6XamP 3 ' heoretical ,w " c di9est 0,,he en,ire 

(61 3 P n,«e,ns> produces 344,855 peptides, bu, only 30,619 of these peptides contain a 

of the tagged peptides completely eliminates contaminants incompatible witi, MS The 
large amounts (milUgrams, of starting p ro ,ei n lysate and involves some type of enrichment 
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these low abundance proteins. Assays described above, start with about 100 pg of protein 
and used no fractionation techniques. Of this, approximately 1/50 of the protein was 
analyzed in a single pLC-MS n experiment. This system has a limit of detection of 10-20 fmol 
per peptide (Gygi, S.P. et al. (1999)). For this reason, in the assays described which employ 
pLC-MS n only abundant proteins are detected. However, the methods of this invention are 
compatible with any biochemical, immunological or cell biological fractionation methods that 
reduce the mixture complexity and enrich for proteins of low abundance while quantitation is 
maintained. This method can be redundant in both quantitation and identification if multiple 
cysteines are detected. There is a dynamic range associated with the ability of the method 
to quantitate differences in expression levels of affinity tagged peptides which is dependent 
on both the intensity of the peaks corresponding the peptide pair (or set) and the overall 
mixture complexity. In addition, this dynamic range will be different for each type of mass 
spectrometer used. The ion trap was employed in assays described herein because of its 
ability to collect impressive amounts of sequencing information (thousands of proteins can 
potentially be identified) in a data-dependent fashion even though it offers a more limited 
dynamic quantitation range. The dynamic range of the ion trap (based on signal-to-noise 
ratios) varied depending on the signal intensity of the peptide pair and complexity of the 
mixture, but differences of up to 1 00-fold were generally detectable and even larger 
differences could be determined for more abundant peptides. In addition, protein expression 
level changes of more than 100-200-fold still identify those proteins as major potential 
contributors to the phenotypic differences between the two original cell states. The method 
can be extended to include reactivity toward other functional groups. A small percentage of 
proteins (8% for S. cerevisiae) contain no cysteinyl residues and are therefore missed by 
analysis using reagents with sulfhydryl group specificity (i.e.,thiol group specificity). Affinity 
tagged reagents with specificities toward functional groups other than sulfhydryl groups will 
also make cysteine-free proteins susceptible to analysis. 

The methods of this invention can be applied to analysis of low abundance proteins 
and classes of proteins with particular physico-chemical properties including poor solubility, 
large or small size and extreme pi values. 
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The prototypical application of the chemisby and method is the establishment of 
quanMabve profiles of complex protein samples and ultimately total lysates of cells and 
tissues following the preferred method deschbed above. In addition the reagents and 
methods of this invetion have applications which go beyond the determination of protein 
express™ profiles. Such applications include the following: 

affin J PP T n °' amin °- reaC,iVe ° r s "'%«ryl-reac,ive. differentia,* isotopically labeled 
affinity tagged reagents for the quanfitative analysis of proteins in immune precipitated 
compiexes. in the preferred version of this technique protein complexes tarn cells 
presenting different states (e.g.. different states of actuation, different disease states 

T ° f differemia,i0n) are ™«n a specfiic reagent, preferably an ' 

embody. The proteins in the precipitated complex are then denvatized and analyzed as 

Application of amino-reacfive, differentially isotopically iabeled affinity tagged reagents 
to deternrme the snes of induced protein phosphoric, in a preferred vein of mis 
mtftod punfied proteins (e.g.. immunoprecipitated from cells under different stimulatory 

are fragmented and derivateed as described above. Phosphopeptides are 
identified ,n the resuifing peptide mixture by fragmentation in the ion source of the ESI-MS 
.n tmrnen, and their rotative abundances are determined by comparing the ion signal 

2dar te eXPerimen,al SamP ' e * ' he in,enSi,y ° f a " inCluded ' is °« 'a»e,ed 

identirv A rr ac,ive ' d " feren,ia " y iso,opica " y iabe,ed amn ** rea ^ « «— <o 

denwy the Mterm.na, ion series in MS" spec*, ,„ , preferrea version of this appljcatiQn ^ 

Phdes to be analyzed are denvatized w„h a 50:50 mixture of an isotopicaliy h« and 
heavy reagent which is specific for amino groups. Fragmentation of the pepfi es by CD 
fiterefore produce two AMermina, ion series which d«er in mass prec Jy by the mis 

dotali T rea9ent SP6CieS USed ' ™ S aPP ' iCa,ton -duces the dfflculty in 

determining the ammo acid sequence of trie derivatized peptide. 
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Quantitative Analysis of Surface Proteins in Cells and Tissue 

The cell exterior membrane and its associated proteins (cell surface proteins) 
participate in sensing external signals and responding to environmental cues. Changes in 
the abundance of cell surface proteins can reflect a specific cellular state or the ability of a 
cell to respond to its changing environment. Thus, the comprehensive, quantitative 
characterization of the protein components of the cell surface can identify marker proteins or 
constellations of marker proteins characteristic for a particular cellular state, or explain the 
molecular basis for cellular responses to external stimuli. Indeed, changes in expression of a 
number of cell surface receptors such as Her2/neu, erbB, IGFI receptor, and EGF receptor 
have been implicated in carcinogenesis and a current immunological therapeutic approach 
for breast cancer is based on the infusion of an antibody (Herceptin, Genentech, Palo Alto, 
CA) that specifically recognizes Her2/neu receptor. 

Cell surface proteins are also experimentally accessible. Diagnostic assays for cell 
classification and preparative isolation of specific cells by methods such as cell sorting or 
panning are based on cell surface proteins. Thus, differential analysis of cell surface 
proteins between normal and diseased (e.g., cancer) cells can identify important diagnostic 
or therapeutic targets. While the importance of cell surface proteins for diagnosis and 
therapy of cancer has been recognized, membrane proteins have been difficult to analyze. 
Due to their generally poor solubility they tend to be under-represented in standard 2D gel 
electrophoresis patterns and attempts to adapt 2D electrophoresis conditions to the 
separation of membrane proteins have met limited success. The method of this invention 
can overcome the limitations inherent in the traditional techniques. 

The analysis of membrane proteins is challenging because they generally are difficult 
to maintain in solution under conditions that are compatible with high sensitivity analytical 
instruments such as mass spectrometers. The application of the methods of the present 
invention to the analysis of membrane proteins is exemplifed using human T cell lymphoma 
cell line Jurkat for membrane protein labeling and extraction and the well characterized 
human prostate epithelial cell line P69SV40T and two P69SV40T sublines which differ in 
IGF-1 receptor expression by factor of 10 to exemplify quantitative, differential analysis of 
membrane proteins. 
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Jurka. cells are an appropriate modal system baoausa .ha cells ara aasy to grow in 

2ZT and because ,he modula,ion of 0611 surface pro,eins in ~° * «** 

*mu„ and expenmenta, conditions has baan well characterized in T lymphocytes 

emZadted" 3 ? 1 ?*" 3 "" 9 " m ° re « *** ~»-* « 

employed to denvauze ,ys,ne residues and the free N-termini. Water soluble biotinytebng 

6 b^I SUlf °- NHS (N " M,0Xy ~ id *> - — (Sultesuccinim idy,- 

r Pi0rC6 ' ,L)WWCh h3Ve bee " — "elyfor 

labe„ng cel. surface proteins can be employed.. The reaction of NHS esters w«h primary 

amrnes is as, a, neutra, P H values and above and is compare with the present 

organ, solvent such as DMSO or DMF. Bio.inyla.ion of cel, surface proteins from the Jurka. 

^seamed PBS buffer at pH 7.2. Cells (1 x 10 7 ) are washed with PBS buffer to^ 

remove contending serum and other proteins from the culture medium. The cells are 

ZZ nu f" 1' Ce '" m ' feaC,ed Wi ' h 0 5 ^ ° f 3""o-NHS-B,o«in (Pierce. 
Rockford IL) for 30 m,n a, RT. The labeled cells are washed twice with cold PBS to remove 
umeacted b,o,in,a,ing reagent. Biotinyiated cells are soluble a. 5 x 10' ce„s/m, ZT 

argent phase and agueous phase a, 30°C. Poling the phase partitioning, decent 
h se ,s removed from the agueous phase by centefugabon a, 300xg. Phase parbtioning 
foun I ^^—'^^^"^-^-^"e. Also.mistechniguawa 

si * biCarb ° na,e bUffef ' PH 8 ' 5 ' and ™— Potctne-bvpsin 

,s added to d,ges„he proteins a. a concentration o, 12.5 ng/m, for overnight a. 37*C 

21 ' S neUt ? lk6d ^ ^ addi ' i0n 3 ^ ° f Serine W*» and u^tic 

pepbd s are .solated by Ihe avidin a«in«y chromatography techniques. Buted pepbdTs are 

using for example, the Sequest program. 

T antigen has Z T * P69SV4 ° T " W * *" im " ed ^ « 40 

anbgen has been well characterized . This eel, line is immortal but not tumorigenic and 

expresses type 1 insulin like growth factor receptor 0GF-1 R)at2x 10 a receptoZ - A 
nude m,ce . Th,s cell i,„e ,s highly tumorigenic and metastabc and exposes 1.1 x 10' IGF- 
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1R per cell. The relative difference in the abundance of IGF-1 R in the cell lines P69SV40T 
and M12 can be quantitatively determined using methods of this invention adapted for 
application to membrane proteins. Since the number of IGF-1 R for these cell lines has 
already been determined, this well characterized system can provide a reference to validate 
the efficiency of the quantitative methods of this invention 

P69SV40T cells ( 1 x 10 7 ) are biotinylated with an isotopically heavy biotin tagged 
amino reactive reagent and the M12 cells (1 x 10 7 ) are biotinylated with a corresponding 
isotopically light amine reactive biotin tagged amino reactive reagent. IGF-1R is then 
immunoprecipitated from the combined lysate of both cell lines using an antibody against 
human IGF-1 R and the total mass of immunoprecipitated proteins is digested with trypsin. 
Trypsin is then neutralized, e.g., by the addition of inhibitors and tagged peptides are purified 
by biotin-avidin affinity chromatography. The eluted peptides are analyzed by LC-MS and 
LC-MS N for peptide quantitation and identification, respectively, as has been described 
above. Quantitation in this experiment is facilitated by the option to use selective ion 
monitoring in the MS. In this mode only the masses of tagged peptide ions expected to 
derive from IGF-1 R need be monitored. 

The described technique can be applied to compare the differences in relative 
abundance of cell surface proteins between parental prostate cell line (P69SV40T) and M12 
cells to detect and identify those cell surface proteins whose expression level is different in 
the two cell lines and which may be characteristic of the different cell states. Using the 
methods described herein global, relative quantitation of the cell surface proteins in any two 
or more cell lines can be analyzed to detect and identify those cell surface proteins 
characteristic of the different cell states. Results can be independent confirmed using 
procedure such as 1 D or 2D gels, if applicable, or quantitative western blotting to confirm 
quantitation results. 

It is expected that the experimental variability of quantitation of cell surface proteins 
will will be onsiderably better than the accuracy of quantitation achieved by currently 
available cDNA array technology. In addition to relative protein quantity and identity, the 
method can also be used to reveal the orientation of the protein in the membrane, based on 
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the presumption that intact, alive cells will exclude the biotinylating reagent. 

Alternative methods can be applied to enhance the selectivity for tagged peptides 
denved from cell surface proteins. For example, tagged cell surface proteins can be 
trysmized directly on the intact cells to generate tagged peptides, purified and analyzed as 
d,scussed. In addition, traditional cell membrane preparations may be used as an MM step 
to ennch cell surface proteins. These methods can include gentle cell lysis with a dounce 
homogenizer and series of density gradient centrifugations to isolate membrane proteins 
pnor to proteolysis.. This method can provide highly enriched preparations of cell surface 
proteins. Affinity tagged proteins may also be isolated by affinity chromatography prior to 
proteolysis as well as after proteolysis. This chromatography can be perforated in the 
presence of surfactants such as TX-100. NP-40 or Tween-20 to maintain protein solubility 
The sequential application of affinity chromatography steps (one for the intact protein and 
one for the tagged peptide fragments) provides a high degree of selectivity. These 
altremative methods are easily scalable for the detection of low abundance membrane 
proteins and the relative quantity of tagged peptides tagged is maintained through the 
selective enrichment steps. 

In the application of the methods of this invention to cell surface proteins, once the 
tagged proteins are fragmented, the tagged peptides behave no differently from the peptides 
generated from more soluble samples. 

Synthesis of affinity tagged protein reactive reagents that are selective for certain 
proteins groups 



Synthetic routes exemplary affinity tagged reagents suitable for use in the methods of 
th,s mvention are provided in Schemes 2-3 where well-known synthetic techniques are 
employed in synthesis of the non-deuterated and deuterated reagents. . 

BioUnyl-iodoacetylamidyM,7.10 trioxatridecanediamine 4 (Scheme 3) consists of a 
bob. group, a chemically inert spacer of capable of being isotopically .abelled with stable 
•sotopes and a iodoacetamidyl group, respectively. The biotin group is used for affinity 
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enrichment of peptides derivatized with the reagent, the ethylene glycol linker is differentially 
isotopically labeled for mass spectral analysis and the iodoacetamidyl group provides 
specificity of the reagent for sulfhydryl-containing peptides. The reagent an be synthesized 
in an all hydrogen form (isotopically light form) with and with 1-20, and preferably 4-8 
deuterium atoms in the linker (isotopically heavy forms). 

Analysis of Velocities of Multiple Enzymes in Cell Lysates 

Monitoring enzyme functions by biochemical assays is an essential diagnostic tool 
that employs a multitude of analytical techniques including spectrophotometric, fluorometric, 
and radiometric detection of products. However, current methods are difficult to use for 
assaying several enzymes simultaneously in a single sample. Mass spectrometry for 
quantification of a collection of metabolites in biological fluids has emerged as a powerful 
approach for the analysis of birth defects (Morris et al., ig94), but this analytical technique 
has not been developed for the direct analysis of rates of individual enzymatic steps. The 
analytical method described herein for monitoring and quantification of enzymatic activities in 
cell homogenates and other biological samples permits simultaneous (multiplex) monitoring 
of multiple reactions, and can be readily automated. 

A feature of the method of this invention as applied to enzyme assays is the use of 
electrospray ionization mass spectrometry (ESI-MS) (Cole et al., 1997) for the simultaneous 
detection of enzymatic products and chemically identical internal standards, which are 
distinguished by stable isotope (deuterium) labeling. A second feature is the use of affinity 
tagged reagents containing an enzyme substrate which when combined with affinity 
purification provide for facile capture of enzymatic products from crude biological fluids. The 
affinity tagged reagents are designed to contain a target substrate for an enzyme of interest 
that is covalently attached to an affinity tag via a linker. Action of the enzyme of interest on 
the substrate conjugate causes cleavage or other modification that changes its molecular 
mass (Scheme 4). The change of mass is detected by ESI-MS. The linker and and affinity 
tag used preferably facilitate ionization by ESI, block action of other enzymes in the 
biological fluid, and allow highly selective capture from the complex matrix for facile 
purification. 
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suh « T eXamP ' e °' aPPr ° aCh iS ' he deSi9 " and Syn,hesis of amn «* ^me 

T rT 1 2 (S ° heme 5) ,0 *"*«•«"* — V l/sosoma, P-galactosidase 
and N-ace^-D^iucosaminidase. respectively. Deficiency of the former enzyme result, in 
one of the lysosoma, storage diseases. GM,-gangliosidosis, a condition that occurs in the 
population with a frequency of about 1 in 50.000 and leads to eady death of affected 
c , dren.. Deficiency of N-acetyi-R-D-glucosaminidase results in the rare lysosoma, storage 

Int T Syndr ° me ^ a ™ S eXamP ' e h3S b6e " d ™ >" a. a,, 

ll entlety ** 11 ° 2 - 1103 which is inc0 ™ "y reference herein in 

Conjugates 1 and 2 consist of biotin as an affinity teg, which is coupled to sarcosine 
B,o n a ows h,gh,y specie capture of the substrate conjugate through non-covalent binding 

me ?! Th r° bilteed °" aSar ° Se b6adS <Bayer 6t a '- 1 "°>- S ™°»» P^daa an N 
m thy ated am,de linkage to biofin to block the enzyme biotinidase. which is often present in 

toe eliuiar u,ds and couid cause cteavage o, the conjugate molecute during the assay 

Wtar e, a,., 1997). ,„ addition, i, was found tha, biofinyl-sarcosine conjugates can be 

.placed from sueptavidin by addition o, biotin. Tha N-biotinyisarcosine block is linked to a 

polyether d,am,ne. the length of which can be varied to avoW mass/charge overlaps of 

products and internal standards. The linker also allows facile introduction of multiple 

ITT T! (i e " 8 deU ' eriUmS " 5 ^ 4 " 6 ' SCheme 5 » t0 "•»* »• «—* of 
CD2 dCnrM t' d8 "" nker made bV rSaCtin9 °°CH2CH20CH2CH20D wtth 

V. BulL Chern. Soc. Jpn. 1988. «. 2443-2450) and the resulfing dinitrile was reduced to toe 

d dT; : r; T d e d4 - linker was made in ,he sa ™ - — ^ *- - « 

auuuMin CH3CN and catalytic NaOH. 

In addition, the linker is hydrophilic to ensure good water solubility of the substrate 
conjugate and it has basic groups which are efficiently pro.ona.ed by ES, and .hus ensure 
senate detection by mass spectrometry. The .arget carbohydrate substrates are attached 
o he polye toer l.nkar by a p-alanine unit (Scheme 5). The enzymatic product conjugates 3 
a 1 4 are also shown Scheme 5. Conjugates 1 and 2 were prapared as shown in Scheme 5 
Al. reagents were purified to homogeneity by reverse-phase HPLC and characterized by 
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high-field 1 H NMR and ESI-MS. The substrate was linked to the diamine spacer by Michael 
addition of the latter onto the p-acryloylamidophenyl glycoside, (Romanowskaet al., 1994) 
and the intermediate was coupled with the tetrafluorophenyl ester of N-biotinylsarcosine 
(Wilbur etal., 1997). 

The ESI-MS assay of p-galactosidase and N-acetyl-R-D-glucosaminidase is based on 
enzymatic cleavage of the glycosidic bond to release monosaccharide and conjugates 3 and 
4 (mass differences are 162 and 203 Da, respectively). In a typical procedure, 0.2 mM 1 and 
0.3 mM 2 were incubated with sonicated cultured fibroblasts from individual patients with B- 
galactosidase deficiency and with fibroblasts cultured from unaffected people. After 
incubation, labeled internal standards 5 and 6 were added, and the biotinylated components 
were captured on streptavidin-agarose beads. Quantitative strepavidin capture efficiency 
from a cell homogenate was observed with model reagents. After purification by multiple 
washings to remove nonspecifically bound components, the biotinylated products were 
released by free biotin, and the eluant was analyzed by ESI-MS. About 85% release of the 
biotinylated products was observed after incubation with excess biotin for 90 min. A blank 
was obtained by quenching the assay with all components present at time zero. 

A typical procedure, cell protein (75 ug) in 15 pL of water was added to 15 pL buffer 
(0.1 M Na citrate, pH 4.25) containing 2 (0.3 mM) and 1 (0.2 mM, added 5 h after addition of 
cell protein). After incubation for 5.5 h at 37 °C, the reaction was quenched by addition of 
200 pL of 0.2 M glycine carbonate buffer, pH 10.3, and 5 and 6 (1 nmol each) were added. 
After centrifugation to remove cell debris, the supernatant was loaded onto a bed of 
streptavidin-agarose (7 nmol biotin binding capacity, Pierce) in a small filtration device (micro 
BioSpin, Bio-Rad). After 5 min, filtration was effected by centrifugation, and the gel bed was 
washed with 0.1% Triton X-100 ( about 1 min incubation, then spin) and then six times with 
purified water (Milli-Q, Millipore). Elution was carried out in 25 pL of 50% methanol containing 
56 nmol of free biotin (1-10 h incubation, then spin). Filtrate was diluted 4-fold with 50% 
methanol/water, and 1 pL was analyzed by ESI-MS. 

The ESI-MS spectrum of the blank (Figure XA)is remarkably simple, showing peaks of 
the (M + H)* ions from reagents 1 and 2 (mlz 843 and 840), internal standards 5 and 6 (mfz 
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68 and 641), and , raC e amounts of product and 4 (mfz 681 and 637). Ions due ,o 
usta. or ,obn also appear in the spectrum but did no, interfere with the analysis. The 

zi ° rf no ? ra,ed praducts in ,he b,ank may be *» to — i 

Z ITT 9 5amPte W ° rk UP ° f ,0 "'""^ "ion of ,ha substrata 
■on ,n tha gas phasa. A MS/MS spectrum of tha (oonjugata 1 + H)> ion a. nvz 843 gava a 

spa .run, o a sample incubated with ce „ homogenate front a healthy India, cleady lows 

1Z T TJT a ' "* ^^'—idase produce 

m/z 637 (Figure XB). Triplioata enzymatic reactions using oalis from a healthy patient yielded 
a P-galactosidase specific activitv of St .>„^„ „ ""y patient yielded 

„,,,„ . V of 51 ± 3 nmol/h/(mg cell protein) and an N-acetyl-c-D- 

* co ammidasa specie activity of 1.4 ± 0.3 nmoWr/mg. Time course studies coined .bat 

11 'IT, veloci,les were bein9 ™- values ob,a ^ ^ «* 

be 1 y indiv, duals ranged from 36 1 4 to 68 ± 3 nmo,/b,mg for p-galactosidase and 09 ± 
0.05 to 2.3 ± 0.4 nmoi/b/mg for N-acetyl-a-D-glucosaminidase. in contrast, very IMe 

ZH f r ° d t 3b0Ve b,a " k ' 9Vel <0 ' 9 * ^ and 0 8 1 0 6 1 observed 

T2T h Pa, ' en,S ^'"a* — •«* were used, wbereas N-acaty,,. 
O amim-dase a«iv«y is ciaady detected (Figure XC). Tbese spectra ware obtained 
-.H 0.75 M g of cei, protein, corresponding to - ,000 fibroblasts. Tbus the ESi-MS method 
has very high sensitivity for biomedical applications. 

ESI-MS was carried out on a Finnigan LCQ ion trap instrument. Date were collected in 

inZrrr ,o 875 by direct at « *->zzz 

oUained from the ratio of product to interna, slandard ion peax areas (averaged over 30 

can J!" T r0aCh deS ° ribed '° r aSSayh9 « Bym " USi "9 substrate agents and ES,- MS 
n be broad,y a P p,ied. The multiplex technigue can be expanded to assay dozens ormol 

n confirming diagnoses o, rare disorders. The method can be used to measure sev ra 
nzymes simultaneously whan evaluating the rata of chemical flux through a spaoT 
^chemical pathway or for monitonng biochemical signaling pathways. The a^teg. 
-Pta. reagent melhod for isolabon of affinty tagged reaction products and substrates from 
compiex mixtures is technical, simpie and can be readily automated. pa Jl 
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strepavidin is employed. Because of the high sensitivity of the ESI-MS detection employed, 
which requires only sub-microgram quantities of the substrate reagents per assay, the 
synthesis of several hundred substrate reagents on a low-gram scale becomes practical and 
economical. Since most enzyme active sites are exposed to solvent, it is possible to attach 
an affinity tagged linker to most enzyme substrates while preserving enzymatic activity. 
Scheme 6 provides the structures of several additional enzyme substrates, suitable for use in 
this method, indicating by arrows allowable positions for tag attachment sites. Allowable tag 
sites for additional enzyme substrates can be determined by review of X-ray crystal 
structures of enzyme-substrate or enzyme-substrate analog structures. Using a standard 
computer graphics program , available X-ray data and by attaching an extended chain butyl 
group (as a model for the affinity tagged linker) to potential tag attachment sites, suitable 
attachment sites that show there are no enzyme-atoms in van der Waals overlap with the 
model tag can be predicted. 

Analogous methods to those described above can be applied to the analysis of 
enzymes associated with other Sanfillipo Syndromes (A, C and D). SFA is associated with 
heparan sulfamidase, SFC is associated with acetyl-CoA-alpha-glucosaminide N- 
acetyltransferase and SFD is associated with N-acetylglucosamine 6-suIfatase. Exemplary 
affinity tagged enzyme substrate reagents useful in the analysis of these enzymes and the 
diagnosis of these disorders are provided below. The methods can also be applied of the 
diagnosis of Niemann-Pick Type A and B disease by assaying for acid sphingomyelinase 
and to the diagnosis of Krabbe disease by assaying for galactocerebroside beta- 
glacatosidase. These enzymes are currently assayed employing fluorphore-derivatived 
reagents as indicated in Scheme 7. Enzyme substrate reagents for assay of these enzymes 
in the methods herein can be readily prepared by replacement of the fluorophore with an A-L 
group herein. This approach to preparation of affinity tagged enzyme substrates is generally 
applicable to any known fluorophore -derivatized enzyme subtrate or substrate analog. 

Table 4 provides exemplary enzymes that are associates with certain birth defects or 
disease states. These enzymes can be assayed by the methods described herein. 



34 



WO 00/1 1208 



PCT/US99/19415 



EnZyma, ' C ** Carbohydrate-DefiCen, Glycoprotein Syndromes 

of mu ,,H e me ' h0dS rea9entS °' "* inVenfcn be emp '°^ t0 « *• velocfties 
of multiple enzymes pertinent to diagnosis of CDGS diseases. 

CD <5STypelaand.barecausedb y thedenciencyorabsenceofthe enzymes 
phosphomannoisomerase (PMIb) (Type ,b, and phosphomannomutase (PMM2, Ipe ,a) 
wh,ch are pad o, a multistep pathway (Scheme 8, for conversion of glucose to manle , 
Phosphate (Preeze. 1 998 , The monosacchafide ° 

ttuc.ose-6-phosphate, mannose-6-phosphate, and mannose-1-phosphate. These 
monosaccharides can be somewhat difficui, to convert to substrate conjugates because ft is 

notapno.clearwhichatomonthesugarshouidbeconiugatedwfththllLro 
.mparnng enzyme acttvity. P Mlb and PMM2 can, however, be assayed indirectly. 
Mammahan cell microsomes contain dolichoi-P-mannose synthase which catalyLs the 

nvtnZ rT * ^ ™* ^ « "* —»* *• -.hods of this 
makes^DP^n^amK) 8 ^J 0 *' 0 "'" 1 '* 8 ' su ' 3strate - Microbial PMM and the enzyme which 
makes GDP-mannose from OTP and mannose-1-P, GDP-mannose pyrophosphorylase are 
eadriy punned from bacteria and yeast (Glase, 1966. Preiss. 1966,1, these ensea l 
be supplred exogeneously to the enzyme assay. If PM, b activity is de„cien. little orT 
mannose-6-P w», be made when the reason seguence is started by addftio of Zse* P 
W„hou, mannose-6-P. mannose-1-P and GDP-mannose wii, no, be made, and th^s no 
conwated-dolichoi-P-mannose wil, be detected by ES.-MS. Exogenous GTP is supXd as 
a regurrement for the GDP-mannose pyrophosphorylase step, and ATP is omKted a 
mannose^P cannot be made from mannose. To assay PMM2, the reaction set 
initiated with mannose-6-P and PMM? Ho*™ , sequence is 

dolichol-P-mannose. ^ ^ " '° ^ 



manv I T " * '° "^"i™ "a«ng tha, 

many enzymes that operate on carbohydrates attached to doiicho, chains are «o,e ran , o 

TZTT 9 " ^ d0,iCh0 ' Chai " ; ~" " 1 — ondo,icho,sa"ed 
— TT' ' " aPPearS,ha — h -^«aCon,hewa«er-so.ub.e 
carbohy dra ,e portron of the dolicho, conjugate and thus have Itttte or no reguiremen, to bind 
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the dolichol anchor. Based on this, an affinity labeled substrate for the direct assay of 
dolichol-P-mannose synthase and the indirect assay of PMIb and PMM2 is prepared by 
attaching an affinity labeled linker to the non-polar end of a short dolichol, such as the 10- 
carbon dolichol analog citronellol. 

The synthesis of a biotinylated dolichol 10 -substrate conjugate containing a sarcosinyl 
linker (B-S-Do^O-P) is shown in Scheme 9. Protected citronellol (R = f-BuSiMe 2 ) is 
regioselectively oxidized at the terminal alyllic methyl group (McMurry and Kocovsky, 1984), 
and the allylic alcohol is coupled with biotinylsarcosine active ester (R = CH 3 ). The citronellol 
1 -hydroxy group is subsequently deprotected and phosphorylated with POCI 3 (Rush and 
Wachter, 1995). In a parallel synthesis, d 5 -sarcosine t CD 3 NHCD 2 COOH, is used to prepare 
the isotopically labelled (heavy) reagent for use as an internal standard. d 5 - Sarcosine is 
readily prepares form commerically available materials (BrCD 2 COOD and CD 3 NH 2 ) using 
standard synthetic techniques. 

The deuterated internal standard, B-d 5 -S-Dol 10 -P-Mannose, is synthesized 
enzymatically by incubating hen oviduct microsomes with GDP-mannose and the synthetic 
B-d 5 -S-Dol 10 -P substrate conjugate (Rush and Waechler, 1995). An added advantage of the 
B-S-conjugate is that it allows for a facile affinity purification of the microsomal mannosylated 
product by specific capture on agarose-streptavidin beads followed by elution with free biotin. 

This method employing affinity tagged short dolichol analogues is generally 
applicable for assaying other enzymes that operated on dolichol anchored carbohydrates. 
Such an approach is useful for the subsequent identification of enzyme deficiencies present 
in other types of CDGS that have not been yet identified. 

CDGS Type II results from defective GlcNAc transferase II (GlcNAc-T II) which 
transfers GlcNAc from UDP-GlcNAc to the 2-position of a mannose residue in the 
intermediate branched oligosaccharide (the Core Region) in the process of building up the 
disialo-biantennary chain (Scheme 10) (Schachter, 1986, Brockhausen et al, 1989). GlcNAc 
transferase II is one of the six known enzymes that mediate highly regiospecific glycosylation 
of the mannose residues in the Core Region. The Core Region is anchored at the reducing 
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Zl Tl Para9ine ' WhSre aSPara9ine reSWue fe " art ° f the P** chain of 
«- ft«**d protein. The tetter structure unit in the substrate can be replaced by a 

hydrophobe chain wtthou, ,oss of enzyme activity (Kaur e. al. 1991). Thus, the substrate 

connate for CDGS Type „ is assembled by Wing a affinfry-labelled iinKer group 

btZ 9 T T^"^ H0WeV8f ' ' he »* - * s bsbafe can 

exanral ^ hydrophobic chain without loss of enzyme activity (Kauret al, 1991). For 

I 2 1 ?r7r' y aVailaWe °- D — P' ra "^Pheny^iso.h,ocyana«ec a n be coupted 

11 1 t CT and ^ 5 ' 6 - hydrOXy ' S ~ d - I" 

Scheme 11 (Paulsen and Meinjohanns, 1992). Coupling of .he ec.ua.oha, 3-OH wi«h per-O- 

ace ty ,mannosy,-1, ri ch,oroace te mida,e (Paulsen e, al. 1993) wii, provide a disaccharide 

" a miTOr am ° U " ,0f « ««-» a1<heaxia,2-OHg,ou Pt he 

prcduas can S6parated by HpLc After deprotectjon g p 

second e qu ,va,en. of per-O-acefylmannosyl-l-ttchloroacetemidate «o yield .he Core Regton 
oon JU ga,e. Depro.ec.ion of the O-acety, groups yields .he subs.rate jugate for "Ic 
I which can be convened .o the GlcNAc-T „ substrate by erratic gly^ 
transfer us,ng a Tnton X-,00 rabbi, fiver extract, a .action mat has been earned out o a 
preparative scale (Kauretal. 1981). meooutona 

The synthesis of the deuterium labeied derivative needed for .he internal standatf is 

el mT ^ ^ 3 ' abe,ed PEMamiTO b ^hloc k (Oerbere1a,,1 9 g r 
The b,o.,ny,a«ed tnsaccharide is converted to the tetrasaccharide (product of GlcNAc-T „ by 

mcuba on with UDP-GlcNAc and transferase I, ( (Kaur and Hindsgaul. 1991,Tan e. J , 

and u«,„z,ng .he B-S handle for affinity purification of.be enzymatic products. ' 

CDGS Type V 

The lipid-iinked oligosaccharide (LLO, .hat is transferred to the Asn residue of.be 
glycosylated protein is composed of 2 GlcNAc, 9 mannoses, and 3 glucoses. „ has recen.lv 

7 " m ° re 9 ' UCOSe dUnn9 LL ° W0 ~ (Komer e. a.. 199^ 

Since fine transferase that attaches the carbohydrate unit „, LLO to the Asn residue 

oZZT T* *"*-*«** LL0 ' C °<* Type V pafiente have fewer numbers 
of carbohydrate un,ts ateched to glycoproteins, such as transferrin (Komer e. a, 1998, 
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However, the few carbohydrate units that are present are full-length, demonstrating that 
residual glucosyl transfer occurs in type V CDGS patients (Korner et al, 1998). Thus, 
quantification of the rate of Asn glycosylation by ESI-MS would constitute a viable assay of 
CDGS Type V syndrome. 

Synthetic peptides with 3-7 amino acid residues containing the Asn-Xaa-Ser/Thr 
sequence have been shown to be good substrates for glycosylation (Ronin et al„ 1981). 
The strategy for the ESI-MS assay of the oligosaccharide transferase relies on a B-S 
conjugate of an appropriate peptide containing the Asn-Xaa-Ser/Thr sequence (Scheme 13 V 
A heptapeptide, NH 2 -Tyr-Gln-Ser-Asn-Ser-Thr-Met-NH 2 has shown high activity in a 
previous study (Ronin et al, 1981). The peptide is readily available by standard peptide 
synthesis using an in-house automatic synthesizer. The heptapeptide and its 
glycoconjugates can be ionized by ESI to provide stable singly-charged ions. Coupling of 
BS-tetrafluorophenyl ester with NH 2 -Tyr-Gln-Ser-Asn-Ser-Thr-Met-NH 2 will directly yield the 
substrate for the transferase. Several products are expected from the enzymatic 
glycosylation and subsequent modifications of the oligosaccharide antenna. The products 
can be prepared enzymatically by incubating thyroid rough microsomes with BS-Tyr-GIn- 
Ser-Asn-Ser-Thr-Met-NH 2 and Dol-P-Glu (Ronin et al, 1981a), followed by affinity 
purification of the biotinylated products. Product distribution due to different degrees of 
glycosylation can be monitored by ESI-MS, and the major components can be purified by 
HPLC. An analogous procedure using a B-N(CD 3 )CD 2 CO- conjugate is used to prepare 
deuterated internal standards. 

The molecular masses of the ionized substrate conjugates for the set of enzymes 
assayed for CDGS la, lb, II, and V syndromes, as well as products and internal standards 
are compiled in Table 5, which shows that no isobaric overlaps among the (M + H) + species 
occur. The close spacing between the (M + Na) + ion from the Type la.b product and the (M + 
H) + ion of the demannosylated B-(N-C 2 D 5 )-2,2-D 2 -Gly-Dol 10 -P internal standard can be readily 
avoided by adjusting the ESI-MS conditions by addition of Na + ions to generate the gas 
phase ions as Na-adducts. 

All three of the targeted enzymes can be analyzed simultaneously in a single 
biological sample, such as a cell lysate. The PMM2 and PMIb cannot be assayed 
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Clinical Enzymology Assays 

fina . J t flbr ° b,aSt Ce " Pelte * iS ,ha - d °" ice. Sufficient 0.9% NaC, is added «o oh,e a 
fine, p o,e,n concern in the ,ysa«e of - 5mg/ mU (typically 100 mcL) J° ^ 

*e ce,, pe,,e, ,s sonicated ,n ,ce water 5 Hmes for 2 seconds each at nil 

ZTZ T " deteTOined S — , y using ,he BOA t: n , 
(BCA Protein Assay kit, Pierce). 

The total enzyme reaction volume is 20 0 *?n mH Th« « u , 

wn i llrriA . . dieb ,s an a 0.2 mM, respect ve v A 

volume of reaction buffer (t* n Qnn^M ^ p^uvc.y. * 

subs Jel^rT*" fe 8 " 0Wed ,0 PTOCeed 4 5 ° 6 hou ^ ««- which GM1 

SFB. : reaCa0 " 18 a " 0Wed 10 for « "ours. Quenching is as for 

After guenching, the samples are p,aced on ,ce. .nterna, standards are added 

Soo ltr 2 ^ 0 ^ ^ ~ - ^rofcg ed a, 
15.000 rpm for 2 m ,n at room temperature to pe„e, ce„ dehds. Stre ptav,d, -Aga.se 
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beads (Immunopure immobilized streptavidin, Pierce) are placed in a micro bio-spin 
chromatography column (Bio-Rad). Sufficient beads are added to give a total biotin 
binding capacity of 5 nmol (typical binding capacity 100pmol per mcL of beads as 
determined by Pierce). The sample supernatant is transferred to the bio-spin tube and 
allowed to bind for 10 minutes at room temperature. The sample is spun at -3,000 rpm to 
remove excess supernatant, then washed once with 0.01% Triton X-100 and at least five 
times with purified water, spinning the tube in-between to remove solution. For each 
wash, sufficient wash solution is added to fill the bio-spin tube. 

The purified beads are then treated with 30 mcL purified water, followed by 10 
mcL of a 4mM biotin solution. The tubes are capped at the bottom to prevent leakage and 
allowed to incubate at 2-8*C for 2-12 hours. The samples are spun at -3,000 rpm to elute 
the sample into a clean Eppendorf tube. 

The sample is then diluted with 60 mcL of 50% methanol/water and infused into 
the ion-trap mass spectrometer. The ESI-MS spectrum is tuned to reduce non-specific 
cleavage of the samples by first analyzing a blank sample (cell lysate added after reaction 
quench). The infused sample is analyzed by ion chromatogram integration of a 1amu- 
wide window about the (M + H+)+ ions of product and internal standard. 

Results are reported in nmol product formed/hour of incubation/milligrams total 
protein in reaction mixtures 

Clinical Analysis of Patient Samples for GM1 and SFB 

Patient skin fibroblasts were obtained as frozen pellets, and stored at Q20 deg C until 
use. Two GM1 affected samples and six normal controls were analyzed. 

50 mcL of 0.9% NaCI was added to each patient cell pellet. The pellets were 
lysed by sonication in ice water 5x for 2 seconds each at moderate sonication power, 
chilling the microtip in ice water in between sonications. 
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Samples were quantitated by BCA (Pierce) assay as follows: 

Reagent A and B were mixed in 50:1 ratio as described. A protein standard curve 

was prepared using bovine serum albumin as a standard at concentrations of 2 1 05 02 
and 0.05 mg/ mL A portion of the patient sonicates were diluted 1:15 in water and 5 
mcL of each diluted patient sample and standard curve point was added to separate glass 
culture tubes containing 200 mcL water, in duplicate. The samples were then diluted with 
1 mL of the mixed BCA reagent, vortexed to mix, and incubated at 37 deg C for 60 
minutes. The samples were allowed to cool to room temperature, and analyzed against a 
blank containing only 200 mcL water. The samples were analyzed by monitoring 
absorbance at 562nm in polystyrene cuvettes. Average patient absorbance values were 
blank corrected and compared to standards via linear regression. 

The patient protein concentrations were determined to be: 

1. (Affected) 12.2mg/mL, 2. (Normal) 10.8mg/mL, 3. (Normal) 1 1 .9mg/ mL 4 
(Normal) 12.1mg/ mL, 5. (Normal) 10.3mg/ mL, 6. (Normal) 7.79mg/ mL, 7. (Normal) 
1 5.7mg/ mL, 8. (Affected) 1 1 .4mg/ mL 

Incubations were performed in a total of 30 mcL of total volume. The amount of 
reaction buffer (200mM sodium citrate, pH 4.25) added to blank Eppendcrf tubes was the 
deference of the substrate volume (3 mcL of each substrate stock solution, 2mM for 
GM1 and 3mM for SFB, for a total of 6 mcL) plus the volume of cell lysate required to 
equal 75mcg total protein, from 30 mcL. For example, patient 1. incubation mixture 
.mtially contained 3 mcL of SFB substrate solution, 6.14 mcL patient cell lysate, and 
17.86 mcL reaction buffer. The GM1 substrate was added later in the incubation (see 
below). 

Each patient sample was analyzed in triplicate. The reaction mixtures were kept 
on ,ce during preparation, and the reaction was initiated by transfer to a 37 deg C water 
bath. 5.5 hours into the incubation, 3 mcL GM1 substrate was added to each reaction and 
after an additional 0.5 hours the reactions were placed on ice and quenched with 200 mcL 
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of a 200mM glycine-carbonate buffer, pH 10.25. 

The purification and analysis procedures are as described in Clinical 
Enzymology Assay (Typical). 

The resultant enzyme activities, as an average standard deviation nmol 
product/hour incubation/mg total protein: 



B-Gal SFB 



Normals 


RATE 


+/-SD 


Rate 


+/-SD 


Patient 2 


68.0 


2.6 


0.90 


0.05 


Patient 3 


35.5 


3.9 


1.54 


0.38 


Patient 4 


51.1 


2.7 


1.36 


0.26 


Patient 5 


38.8 


8.3 


1.01 


0.12 


Patient 6 


51.4 


9.9 


2.25 


0.36 


Patient 7 


40.9 


3.7 


1.12 


0.20 


Affecteds 










GM t (#1) 


0.9 


0.9 


0.80 


0.21 


GM) (#8) 


0.8 


0.6 


0.70 


0.20 
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The following synthetic method refer to Schemes 14-23 

Synthesis for GM1 -gangliosidosis (beta-D-galactosidase deficiency) 

1. 2.3,5,6-Tetrafluorophenyl trifluoroacetate (1) 25g (0.15mol) 2,3,5 6- 
tetrafluorophenol, 35 mL (0.2mol) trifluoroacetic anhydride and 0.5 mL 
boron trifluoride etherate were refluxed for 18 hours under argon 
atmosphere. Trifluoroacetic anhydride and trifluoroacetic acid were 
removed by distillation at room temperature. The trifluoroacetic 
anhydride fraction was returned to the mixture, and the reaction was 
refluxed for 24 hours. This was repeated twice. After final distillation at 
room temperature, the desired product 1 was distilled at reduced 
pressure (62?C/45mmHg) to produce a colorless liquid (30g, 82%). 1 H- 
NMR. (Ref. Gamper, H. B., Nucl. Acids Res., v21 pp145-150) 

2. Biotin-2,3,5,6-tetrafluorophenyl ester (2) A 2.5g (10.3mmol) quantity of 
d-biotin in 20 mL anhydrous DMF under argon atmosphere was 
warmed to 60?C with stirring to effect dissolution. 1.7 mL (12.5mmol) 
triethylamine was added, followed by 3.4g (12.5mmol) 1. The mixture 
was stirred for 2 hours, after which the solvent was removed by rotary 
evaporation. The resultant semi-solid was triturated with 15 mL ether 
twice to produce a white solid (2.6g, 65%). 1 H-NMR. (Ref. Wilbur, D. 

S., Bioconj. Chem., v8 pp572-584) 

3. N-methylglycylbiotinamide-methyl ester (3) A 2.5g (6.4mmol) quantity 
of biotin tetrafluorophenyl ester in 30 mL anhydrous DMF under argon 
atmosphere was added to a mixture of 1.1g ( 7 .7mmol) N-methylglycine 
methyl ester hydrochloride dissolved in 1 0 mL anhydrous DMF and 
1.25 mL (9.0 mmol) triethylamine. The reaction mixture was stirred at 
room temperature for 2 hours, then the solvent was removed by rotary 
evaporation. The residue was extracted with choroform (2x100 mL), 
washed with water (2x20 mL), and dried with anhydrous sodium 
sulfate. The solvent was removed under vacuum to yield 2.1g (98 %) 

of methyl ester of N-methylglycine biotinamide as an oil. 1 H-NMR. (Ref. 
Wilbur, D. S., Bioconj. Chem., v8 pp572-584) 
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4. N-methylglycylbiotinamide acid (4) N-Methylglycylbiotinamide methyl 
ester was hydrolyzed in a mixture of 31 mL MeOH and 10 mL of 1N 
NaOH at room temperature with stirring for 1 hour. The mixture was 
diluted with 50 mL 50% MeOH/water and neutralized with cation 
exchange resin, hydrogen form (AG MP-50, BioRad). The solution was 
filtered, the resin washed (3x50 mL) with 50% MeOH/water, and the 
solvents removed by rotary evaporation to yield 1 ,6g (90%) of N- 
methylglycylbiotinamide acid as an off-white solid. 1H-NMR. (Ref. 
Wilbur, D. S., Bioconj. Chem., v8 pp572-584) 

5. p-Acrylamidophenyl-?-D-galactopyranoside (5) 40mg (0.15mmol) p- 
aminophenyl ?-D-galactopyranoside was added to 25 mL methanol 
and 200 mcL triethylamine with stirring. The solution was chilled in an 
ice bath. 53.3mg (0.6mmol) acryloyl chloride was dissolved in 5 mL dry 
methylene chloride and added dropwise to the stirred solution over 5 
minutes. The reaction was allowed to return to room temperature, 
followed by 2 hours of stirring. The solution was then treated with 
successive anion and cation exchange resins (AG MP-1 and AG MP- 
50, respectively, BioRad) until neutral pH was obtained with moist pH 
paper. Solvent was removed by rotary evaporation to yield a solid 
(43mg, 90%). 1H-NMR. (Ref. Romanowska, A., Methods Enzymol., 
v242pp90-101) 

6. Michael addition product of 4,7,1 0-trioxa-1,13-tridecanediamine and 5 
(6) 20mg (0.07mmol) 5 was added to a stirred solution of 80mg 
(0.35mmol)4,7,10-trioxa-1,13-tridecanediamine in 5 mL 0.2M sodium 
carbonate, pH 10.5 at 37?C. The reaction was allowed to proceed for 3 
days, after which the solution was neutralized with dilute trifluoroacetic 
acid and purified by reverse-phase HPLC (Vydac C-18 prep-scale 
column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)) 
to give 7.3 mg of product. (Ref. Romanowska, A., Methods Enzymol., 
v242pp90-101) 

7. GM1 substrate conjugate of 4 and 6 (7) A 2.5mg (7.4mcmol) quantity 
of 4 was dissolved in 1 .5 mL anhydrous DMF with stirring, under argon 
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atmosphere. 5 mcL Wethylamine was added, followed by 2 3mg 
(8.8mcmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H. Rf 0.5. UV) by briefly d*ng the spotted TLC 
plate w,.h a stream of air. After 25 minutes, the mixture was added to 
3.2mg (5.9mcmol) 6 in 1 mL anhydrous DMF. After 2 hours the 
solvent was removed by vacuum centrifugation and the flnal product 
was punfied by reverse-phase HPLC (Vydac C-18 prep-scale column 
6 mL/m,n. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)) Yield ' 
4.6 mg. (Analogous chemistry, ref. Wilbur, D. S., Bioconj. Chem v8 
PP572-584) 

8. 1 . 2 .10.11-octadeutero-3.6,9.trioxa.1,11-undecanedinitrile (8) 1g 
(9.4mmol) of diethylene glycol was dissolved in 2 mL D20 in a 10 mL 
round bottom flask under argon atmosphere. The D20 was removed by 
rotary evaporation and the process was repeated 4 times. The d-2 
Methylene glycol was addifioned with 25 mL dry benzene, followed by 
16 g (28.2mmol) d-3 aoytonitrile with stirring under argon atmosphera 
After 12h, the solvent was removed under reduced pressure and the 
resultant semisolid was extracted with chloroform (2x5 mL) The 
solvent was removed by rotary evaporation to yield 1 .85g (89%) 
product. (Ref. Ashikaga, K., Bull. Chem. Soc. Jpn., v61 pp2443-2450) 
9. 2.3.11.12-oc.adeutero-4,7.10-trioxa-1.13-tridecanediamine (9) Raney 
mckel (Aldrich) was washed five times with anhydrous methanol by 
inversion and decantation. 50mg of the washed catalyst was placed in 
20 mL anhydrous methanol, followed by 1g (4.6mmol) 8 in a 50 mL 
screw-cap vial fitted with a Teflon-lined rubber septum. The vial 
headspace was flushed for a few min with H2 gas via an 18-gauge 
needle piercing me septum. The cap was screwed on tightly and the 
enfire assembly was charged to 40psi H2 and placed in a hot water 
bath (80 C) for 4 hours, after which the solid catalyst was removed by 
filtrafion and the methanol evaporated. The final product was purified 
by reverse-phase HPLC (Vydac C-18 prep-scale c»lumn. 6 mUmin 
Mobile phase: H20 (0.08%TFAV ACN (0.08%TFA)). Yield 180 mg ' 
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(Ref. Ashikaga, K., Bull. Chem. Soc. Jpn., v61 pp2443-2450) 

10. Deuterated analog of 6 (10) 25mg (0.09mmol) 5 was added to a stirred 
solution of 90mg (0.4mmol) 9 in 5 ml_ 0.2M sodium carbonate, pH 10.5 

at 37?C. The reaction was allowed to proceed for 3 days, after which 
the solution was neutralized with dilute trifluoroacetic acid and purified 
by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. 
Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 6 mg. 

1 1 . Deuterated analog of 7 (1 1 ) A 3mg (8.4mcmol) quantity of 4 was 
dissolved in 0.7 mL anhydrous DMF with stirring, under argon 
atmosphere. 5 mcL triethylamine was added, followed by 2.4mg 
(8.9mcmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 
6mg (11mcmol) 10 in 1 mL anhydrous DMF. After 2 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 
mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 1 .8 
mg. - 

12. GM1 internal standard conjugate (12) 1.8mg 11 was added to 2 mL 
100mM Tris/10mM MgCI2, pH 7.3 buffer with stirring. 15 units 
recombinant ?-D-galactosidase (Sigma) was added, and after 12 hours 
the mixure was purified by reverse-phase HPLC (Vydac C-1 8 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ACN 
(0.08%TFA)). Yield 1.5 mg. 

Polyether diamine linker synthesis (Second Generation) 

Synthesis is based on chemistry previously described (Kataky, R. et. al. J CHEM 
SOC PERK T 2 (2) 321-327 FEB 1990), with minor modifications and an additional two 
steps. As an example, deviations from the established procedure as well as exact details 
for the additional steps are outlined below for the starting material diethylene glycol. 
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h h Cyan0 " 3 ' 6 ' 9 - ttoxaundecane ( 13 ) T oasti.TedsoluUonof2»/ <w/v)sodium 

roxrte (5 m L ) and diethyiene glycol (5.3g. 50mmol> was added acrylonitriie 7.9Sg 
SOn, mol> The mixture was stirred a. room temperature overnight and additioned wflh 
50 mL .chloromethane. The organio layer was washed 2x with brine and dried ( M gS04) 
The so vent was removed by rotary evaporation. The oily residue was treated with 200 
proof ethano.. and the solvent was removed by rotary eva pora«on. This was repeated 2x 
to remove excess unreacted acrylonitrile. The product was used without further 
purification . . 

Diethyl 4.7,10-trioxatridecane-1.13-dioata (14) 2g (9.4mmol) 13, was dissolved 
m 5 mL ethanol. 1g cone, sufloric acid was added slowly, over 5 minutes. The reaction 
was heated to reflux overnight. The reaction was extracted with 40 mL methylene 
ohloride. washed once with 1 0 mL water and 3x with 10 mL dilute brine solution The 
organic layer was dried (MgS04) and solvent was removed to yield an oil. The final 
product was purified by silica chromatography (methylene chloride/ethyl acetate). 

te.rah„H 1 r ihydrOXy " 4 ' 7,10 " W ° XattdeCane <15) Prepar6d as **=*ed. «*0 

etrahydrofuran as solvent. (1.7g. 5.5mmo, 14, 50 mL distilled [CaH2] THF. 0 66g 

16.5mmol lithium aluminum hydride). Once addition was complete, excess LAH was 

quenched with ethanol, and the salts precipitated by dropwise addition of saturated 

sod,um sulfate solution until a white precipitate formed. The solvent was removed the 

precprtate washed 6x 30 mL wflh THF and the combined organic extracts were ' 

evaporated to yield an oil. Final product is purified by silica chromatography (firs, wflh 

methylene ohloride then with a.hyt acetate and finally with acetone). 

was J' 1 "' 0 ;' 0 ^' 7 - 10 -^™^ (analog using P2, (16) 1.1g. ( 4 .9mmol) 15 
was added to 1,5g (l4.6mmo„ distilled pyridine in 30 mL dry benzene with sfirrtn 
oflowed by 1 ,8g (14.6mmol> .hiony, chloride. The mixture was healed to reflux for 6 
hours. After cooling in an ice bath. 5 mL 3M HCI was added with vigorous stirring The 
organ* layer was separated, washed 3x with a dilute brine soiution, and dried (NaS04) to 
View a ye„ow,sh oil. Afterwashing and removal of solvent, the dichlorida was used 
without further purification. 
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Additional steps 

1,13-dicyano-4,7,10-trioxatridecane) (17) To a stirred solution of 0.78g 
(15.5mmol) sodium cyanide in 4 ml_ dimethyl sulfoxide at 80*C was added 1g 
(3.9mmol) of 16. After 2 hours, the reaction was additioned with 10 mL of saturated 
sodium chloride solution, 5 mL of water, and 50 mL ethyl acetate. The organic layer was 
washed 3x with a brine solution as before, after which the organic layer was dried 
(Na2S04) and the solvents removed. The final product was purified by silica 
chromatography (methlyene chloride/ethyl acetate). ESI-MS: predicted, 240.1; observed, 
241.1 (M + H+)+ 

1 ,15-diamino-5,8,1 1-trioxapentadecane (18) A stirred solution of 50 mL dry 
THF containing 0.42g (1 0.4mmol) fresh LAH was heated to gentle reflux under argon for 
15 minutes. 0.5g (2mmol) 17 in 15 mL dry THF was added dropwise over 20 minutes, 
maintaining a gentle reflux. The unreacted LAH was quenched with ethanol, and the 
mixture was treated with dropwise addition of saturated sodium sulfate under efficient 
stirring until a white precipitate formed. The mixture was filtered, and the precipitate was 
washed 6x 30 mL with THF. The organic extracts were combined and the solvent was 
removed by rotary evaporation to yield an oil. ESI-MS: predicted, 248.1; observed, 249.1 
(M + H+)+ 

Deuteration 

Deuterium has been incorporated into the diamine linker by reduction of 14 and 
17 using lithium aluminum deuteride (98% D) to achieve a d-8 deuterated diamine. No 
other aspects of the synthesis were changed for this procedure. These diols are used in the 
construction of the SFD conjugates as described later. 

Clinical Substrate Synthesis for Sanfilippo Syndrome, type B (N-?-D- 
glucosaminidase deficiency) 

13. p-Aminophenyl-?-D-N-acetylglucosamine (19) 20mg (0.07mmol) p- 
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N,trophenyl-?-D-N-acetylglucx)samine (Sigma) was added to 5mg 
washed palladium catalyst on activated carbon in 3 mL methanol with 
st.rnng in a 5 mL septa-lined vial. The septum was pierced by a 16- 
gauge needle and the vial headspace was flused with H2 gas H2 gas 
was allowed to slowly bubble through the solution for 2 hours after 
wh,ch the catalyst was removed by filtration over diatomaceous earth 
(Cel,te). The solvent was removed by rotary evaporation to yield a 
semi-solid 18mg (90%). 

14. P-Acrylamidophenyl-?-D-N-acetylglucosamine (20) 10mg (0.03mmol) 
19 was added to 15 mL methanol and 100 mcL triethylamine with 
st.rnng. The solution was chilled in an ice bath. 15mg (0.17mmol) 
acry.oyl chloride was dissolved in 2 mL dry methylene chloride and 
added dropwise to the stirred solution over 5 minutes. The reaction 
was allowed to return to room temperature, followed by 2 hours of 
stirring. The solution was then treated with successive anion and 
cation exchange resins (AG MP-1 and AG MP-50, respectively 
BioRad) until neutral pH was obtained with moist pH paper. Solvent 
was removed by rotary evaporation to yield a solid (1 1 mg, 95%) 1 H- 
NMR. Yield 11 mg. 

15. 3,6-dioxa-1,9-nonanedinitrile (21) 2g (0.032mol) ethylene glycol was 
added to 0.5 g dry potassium hydroxide in 30 mL dry benzene 
followed by 5g (0.096mmo.) acrylonitrile with stirring overnight at room 
temperature. The reaction was filtered and the solvent was removed by 
rotary evaporation to yield an oil. Final product was purified by silica 
chromatography (chloroform/methanol) to yield a colorless oil 3 2g 



16. 4,7-dioxa-1 ,10-decanediamine (22) Raney nickel (Aldrich) was washed 
five t.mes with anhydrous methanol by inversion and decantation 
50mg of the washed catalyst was placed in 20 mL anhydrous 
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methanol, followed by 1g (6mmol) 21 in a 50 mL screw-cap vial fitted 
with a Teflon-lined rubber septum. The vial headspace was evacuated 
with H2 gas via an 16-gauge needle piercing the septum. The cap was 
screwed on tightly and the entire assembly was charged to 40psi H2 
and placed in a hot water bath (80;:C) for 4 hours, after which the solid 
catalyst was removed by filtration and the methanol evaporated. The 
final product was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). 

17. Michael addition product of 20 and 22 (23) 5mg (0.015mmol) 20 was 
added to a stirred solution of 13mg (0.06mmol) 22 in 5 mL 0.2M 
sodium carbonate, pH 10.5 at 37?C. The reaction was allowed to 
proceed for 3 days, after which the solution was neutralized with dilute 
trifluoroacetic acid and purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 6 mg. 

18. SFB substrate conjugate of 4 and 23 (24) A4mg (0.013mmol) quantity 
of 4 was dissolved in 1 .5 mL anhydrous DMF with stirring, under argon 
atmosphere. 10 mcL dry triethylamine was added, followed by 4mg 

(0.01 5mmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 
6mg (0.012mmol) 23 in 1 mL anhydrous DMF. After 2 hours, the 
solvent was removed by vacuum centrifugation and the final product 
was purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 
6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 
4.2 mg. 

19. 1,9-tetradeutero-3,6-dioxa-1,9-nonanedinitrile (25) 0.5g (8mmol) 
ethylene glycol was added to 0.1 g dry potassium hydroxide in 20 mL 
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acetonitrile. followed by 1.4g (24mmol)d-3 acrylonitrile with stirring 
overnight at room temperature. The reaction was filtered and the 
solvent was removed by rotary evaporation to yield an oil. Final 
produot was purified by silica chromatography (chloreform/methano.) to 
yield a colortess oil 0.9g (65%). 

20. 1 ,9-tetradeutero.3,6-dioxa-1 ,9-nonanediamine (26) Raney nickel 
(Aldnch) was washed five times with anhydrous methanol by inversion 
and deoantation. 20mg of the washed catalyst was placed in 30 mL 
anhydrous methanol, followed by 0.5g (3mmol) 25 in a 50 mL screw- 
cap vial fitted with a Teflon-lined rubber septum. The vial headspace 
was evacuated with H2 gas via an 18-gauge needle piercing the 
septum. The cap was screwed on tightly and the entire assembly was 
charged to 40psi H2 and placed in a hot water bath (80 C) for 4 hours 
after which the solid catalyst was removed by filtration and the 
methanol evaporated. The final product was purified by reverse-phase 
HPLC (Vydac C18 prep-scale column, 6 mL/min. Mobile phase- H20 
(0.08%TFAy ACN (0.08%TFA)). 

21. Deuterated analog of 23 (27) 20mg (0.07mmol) p-acrylamidopheny, 2- 
D-galactoside was added to a stirred solution of 90mg (0.4mmol, 26 in 

5 mL 0.2M sodium carbonate, pH 10.5 at 37?C. The reaction was 
allowed to proceed for 3 days, after which the solution was neutralized 
with dilute trifluoioacetic acid and purified by reverse-phase HPLC 
(Vydac C-18 prep-scale column. 6 mL/min. Mobile phase- H20 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 2 mg. 

22. Deuterated analog of 24 (28) A 2mg (6.3mcmol) quantity of 4 was 
dissolved in 1.5 mL anhydrous DMF with stirring, under araon 
atmosphere. 5 mcL triethylamine was added, followed by 2 1mg 
(7.6mcmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H. Rf 0.5. UV) by briefly drying the spotted TLC 
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plate with a stream of air. After 35 minutes, the mixture was added to 
4mg (7mcmol) 27 in 1 mL anhydrous DMF. After 2 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 
mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 1.2 
mg. 

23. SFB internal standard conjugate (29) 1.2mg 28 was added to 2 mL 
100mM Tris/10mM MgCI2, pH 7.3 buffer with stirring. 15 units 
recombinant ?-D-galactosidase (Sigma) was added, and after 12 hours 
the mixure was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ACN 
(0.08%TFA)). Yield 0.7 mg. 

Clinical Substrate Synthesis for Sanfilippo Syndrome, type D 
(a sulfatase deficiency) 

24. p-Acrylamidophenyl-?-D-N-acetylglucosamine-6-sulfate (30) 100mg 
(0.28mmol) 20 was added to 10 mL dry DMF under argon atmosphere 
with stirring at room temperature. 89mg (0.56mmol) sulfur trioxide- 
pyridine complex was dissolved in 2 mL dry DMF and was added to 
the reaction in 0.7x, 1.1x, 1.3x and 1.9x equivalents (+700 mcL, +400 
mcL, +200 mcL, and +600 mcL). The reaction progress was monitored 
by 1 H-NMR shift of the anomeric (C1 ) proton chemical shift from 5.29 
to 5.24 ppm by removal of 15 mcL of solution 1 hour after addition of 
each amount of sulfating reagent. The removed mixture was dried by 
vacuum centrifugation and redissolved in d-6 DMSO and analyzed. 
Upon the appearance of more than two forms (starting material and C- 
6 sulfate) of the C1 anomeric proton, the reaction was removed to - 
20?C and stored. The product was purified by vacuum centrifugation to 
remove solvent, followed by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
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(0.08%TFA)). Yield 72%. 

25. Michael addition product of 18 and 30 (31) 25mg (0.058mmol)30 was 
added to a stirred solution of 83mg (0.35mmol) 18 in 5 mL 0 2M 
sodium carbonate, P H 10.5 at 37?C. The reaction was allowed to 
proceed for 3 days, after which the solution was neutralized with dilute 
tnfluoroacetic acid and purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 10 mg. 

26. SFD substrate conjugate of 4 and 31 (32) A 5.7mg (0.018mmol) 
quantity of 4 was dissolved in 1 .0 mL anhydrous DMF with stirring 
under argon atmosphere. 20 mcL dry triethylamine was added 
followed by 5.5mg (0.020mmol) 1 . The formation of active ester was 
momtored by silica TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly 
drying the spotted TLC plate with a stream of air. After 25 minutes the 
mixture was added to 10mg (0.015mmol) 31 in 1 mL anhydrous DMF 
After 2 hours, the solvent was removed by vacuum centrifugation and 
the final product was purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H20 (0.08%TFAV ACN 
(0.08%TFA)). Yield 5.4 mg. 

27. 1.2.14,15-octadeutero-1 > 15-diamino-5,8,11-trioxa P entadecane (33) As 
referenced in oPo.yether Diamine Linker Synthesis, Second Generation 

28. Deuterated analog of 31 (34) 25mg (0.07mmol) 20 was added to a 
st.rred solution of 100mg (0.4mmol) 1 1 in 5 mL 0.2M sodium 
carbonate, pH 10.5 at 37?C. The reaction was allowed to proceed for 3 
days, after which the solution was neutralized with dilute trifluoroacetic 
acd and purified by reverse-phase HPLC (Vydac C-18 prep-scale 
column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 7 mg. 

29. SFD internal standard conjugate (35) A 4mg (12.6mcmol) quantity of 4 
was dissolved in 1 mL anhydrous DMF with stirring, under argon 
atmosphere. 20 mcL triethylamine was added, followed by 4mg 
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(14mcmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 20 minutes, the mixture was added to 
7mg (1 1 mcmol) 34 in 1 mL anhydrous DMF. After 4 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 
mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 2. 7 
mg. 

N-(d-Biotinyl-sarcosinyl)-12-aminododecanoic acid (36). Compound 4 (32.2 mg, 0.102 
mmole) was dried overnight in vacuo (with P205). Dry DMF (2 mL) was added and the 
mixture was stirred with warming to affect dissolution under nitrogen. Triethylamine (34 
mcL) was added followed by 1 (20.4 mcL, 0.115 mmole) added in two 10.2 mcL 
portions, 5 min apart. The mixture was stired for 1 hr at room temperature under 
nitrogen. 12-Aminododecanic acid (24.1 mg, 0.112 mmole, Sigma) was added in one 
portion, and the mixture was stirred at room temperature for 2 hr under nitrogen. CHCI3 
(80 mL) was added, and the organic solution was washed with two 10 mL portions of 1 
M HCI. CHCI3 was removed by rotary evaporation, and residual DMF was removed by 
vacuum centrifugation. The compound was dissolved in methanol and purified by HPLC 
(Vydac 218TP prep column). Solvent program is: 0-10 mih, water with 0.06% TFA; 10- 
55 min, 0-100% methanol with 0.06% TFA, flow rate is 6 mL/min. Yield 31 .7 mg. 1H- 
NMR. ESI-MS, calculated 51 3.4, observed 51 3.4 (M + H)+ 

N-hydroxysuccinimidyl ester of 36 (37). Compound 36 (9.8 mg, 19 mcmole) is dissolved 
in 100 mcL of dry DMF under nitrogen. N-hydroxysuccinimide (2.2 mg, 19 mcmole) 
was added followed by dicyclohexylcarbodiimide (3.9 mg, 19 mcmole). The mixture 
was stirred at room temperature for 60 h in the dark. Solvent was removed by vacuum 
centrifugation, and the residue was submitted to flash chromatography on silica gel using 
a gradient of CHCI3/CH30H (15/1) to CHCI3/CH30H (12/1). Yield 9.8 mg. 1H-NMR. 
ESI-MS, calculated 610.8, observed 609.7 (M + H)+ 

N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)-pyschosine (38). Compound 37 (6.2 
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mg 10 momole) and pyschosine (4.7 mg, 10 m cmole. Sigma) were dissolved in 200 
mcL of dry DMF under nitrogen. Diisoproyiethylamine (5 mcL) was added, and the 
mature was stirred under nitmgen for 2 days in the dark. The compound was injected 
d,rec iy onto the HPLC ootumn (Vydac 218TP semi-prep,, and the cotumn was 
developed at 2 mUmin with 0-20 min, water with 0.06% TFA, then 20-80 min, 0-100% 
•Th^'^^' ^ ^ ^ ^"^^ ES1-MS, calculated 057.3, observed 

Sph.ngosylphosphorylcholine (4.0 mg. Sigma) was mixed with 1 mL dry DMF and 

ZZ 7 T" ^ VaCUUm Cen,ri,U9a " 0n - TWS W3S ~ *» ™» «-s. The 
a i dned res,due weighed 2.5 mg (5.4 mcmoie). To this residue was added 3.3 mg of 

37(5.4 mcmo, e) ,150 moLofdryD M F.and2.5 mcLofdlisoproytethyiamine. The 
m«ture was stirred under nfcogen in the dark for 3 days. The compound was injected 
d.redly onto the HPLC column (Vydac 218TP semi-prep). and the column was 
developed at 2 mUmin with 0-20 min. water with 0.06% TFA, then 20-80 min, 0-100% 

958 7 <M C 0 '° 6% TFA ' Yfe ' d " "* 1H " NMR - ES '- MS ' 96 °* 



Conjugate of d . biotln wjth , , 13 ^iamino^,7,10- M oxamdecane (40). Compound 2 was 
reacted wrf 1.13-diamino-4.7.10.,noxa«hdecane (Fluka) essenfielly as descnoed forthe 
£hes,s of 3. The product was pun.ed hy HPLC (Vydao 218TP. semi-p.p) using 0- 
100% methanol with 0.06% TFA over 30 min at 1 .5 mL/min. 

todoacetylated 40 (41 ). Compound 40 was treated with 5 equivalents of iodoacetic 
a hydnde (Aldrich, in dry DMF with stimng under nkrogen for 4 h a, noom temperature 
The product was punfed on HPLC as for 40. The structure was confirmed by ESI-MS. 

Octodeu,era«ed41 ,42). The title compound was prepared as for the 40 using 9 instead 
of I.IS-diamincMJ.lO-trioxatridecane. 
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Octadeuterated 42 (43). The title compound was prepared from 42 as for 41 . The 
structure was confirmed by ESI-MS. 
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Ex mplary MS N Techniques and Instrumentati n 

An automated LC-MS/MS system for the identification of proteins by their amino acid 
sequence has been developed. A schematic representation is shown in Fig. 7 The system 
wh,ch consists of an autosample, a capillary HPLC system connected on-line to an ESI triple 
quadmpole MS/MS instrument and a data system is operated in the following way: Proteins 
(typrcally separated by 1 D or 2D ge, electrophoresis) are cleaved wfth a specific protease 
usually trypsin, the resulting cleavage fragments are placed in an autosampler. Eve™ 37 
mmutes the autosampler injects one sample into the HPLC system and the peptides are 
separated by capillar reverse-phase chromatography. As separated peptides eiute from the 
chromatography column, they are ionized by the ESI process, enter me MS and the mass to 
charge ratio <«*) is measured. Any peptide ion whose intensity exceeds a predetermined 
>ntens,ty threshold is automatically selected by the instrument end collided In the collision ceil 
wrth ,nert gas. These collisions result in peptide fragmentation, primarily at the bonds of the 
pept.de backbone (collision induced dissociation, CID). The masses of the CID fragments 
are measured and recorded in the date system. The CID spectrum of a peptide contains 
sufficent information to identify the protein by searching sequence databases with the 
uninterpreted MS/MS spectra. This is accomplished with the Sequent program The 
program identifies each peptide in a sequence database whicn has the same mass as the 
peptide that was selected in the MS for CID and predicts the MS/MS specuom for each one 
of the rsobaric peptides. By matching the experimentally determined CID spectrum with 
computer generated theoretical CD spectra, the protein from which the observed peptide 
ongmated is identified. The system is capable of analyzing protein samples in a fully 
automated fashion at a pace o, lass than 40 min. per sample. Since each peptide represents 
an ndependent protein identification and usually multiple peptides are derived from one 
protein, protein identification by this method is redundant and tolerant to proteins co- 
migrating in a gel. The system is well suited for the detection and characterization of 
modified residues within polypeptide chains. The LC-MS/MS technique and automated 
analysis of the generated CD spectra can be used for.he methods of this invention. 

above, except that peptide separafion and ionization are performed a, signfficantly higher 
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sensitivity. Fig.8 shows a schematic representation of the key design elements. The design 
of the system and its mode of operation have been published. Peptides derived from protein 
digests are concentrated by SPE, separated by CE and analyzed by ESI-MS/MS. The 
resulting uninterpreted CID spectra are used to search sequence databases with the 
Sequest software system. The SPE extraction device is a small reversed-phase 
chromatography column of the dimensions 0.18 x 1 mm which is directly packed in a fused 
silica separation capillary. Peptides contained in a sample solution are adsorbed and 
concentrated on the SPE device, eluted in an estimated 100 - 300 nl of organic solvent and 
further concentrated by electrophoretic stacking and/or isotachophoresis to an estimated 
volume of 5-30 nl. The peptides are then separated by CE in a 20 urn or 50 urn i.d. capillary 
and directly ionized by ESI as they leave the capillary (see reference 13 for design of the 
microspray ionization source). With this system, peptide masses can be determined at a 
sensitivity of 660 attomoles (approx. 500 fg for a 20 residue peptide) at a concentration limit 
of 33 amol/ul and that proteins can be identified by the CID spectra of automatically selected 
peptides at less than 10 fmol (0.5 ng for a protein of 50 kDa) of sample at a concentration 
limit of less than 300 amol/ul. this technique is used for the analysis at very high sensitivity 
of the peptide samples generated by the experiments. It has also been demonstrated that 
the analysis time available for automated CID experiments can be significantly extended by 
data-dependent modulation of the CE voltage. If several peptide ions are detected 
coincidentally in the MS, the CE voltage is automatically dropped. This results in a reduction 
of the electroosmotic flow out of the capillary and therefore in an extension of the time period 
available for selecting peptide ions for CID. The net effect of this peak parking technique is 
an extension of the dynamic range of the technique because the increased time available is 
used for CID of ions with a low ion current Once all the peptide ions are analyzed, 
electrophoresis is automatically reaccelerated by increasing the CE voltage to the original 
value. 
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, T ^t E 1 " Relative - redundant quantitation of a-lactalbumin abundance (after mmno 



Peptid 
e# 


m/z 
(light) 


Charg 
e state 


Peptide Sequence 


r\auo 
(heavy:light) 


1 


518.4 


2+ 


(K) IWCK 


2.70 


2 


568.4 


2+ 


(K)ALCSEK 


2.68 


3 


570.4 


2+ 


(K) CEVFR 


2.90 


4 


760.5 


2+ 


(K) LDQWLCEK 


2.82 


5 


710.1 


3+ 


(K) FLDDDLTDDIMCVK 


2.88 


6 


954.2 


3+ 


(K) DDQNPHSSNICNISCDK 


2.90 


7 

3 Isotnne 


1286. 
9 

ratir* u/nc 


4+ 

i nnf or»ol 


(K) GYGGVSLPEWVCTTFHTSGYDT 
QAIVQNNDSTEYGLFQINNK 


NA a 



highly overlapping due to differences^ only i arrTu bei^en hS^^Sd^ 



59 



WO 00/11208 



PCTAJS99/19415 . 



TABLE 2. Sequence identification and quantitation of the components of a protein 
mixture in a single analysis. 



Gene Name* 


Peptide sequence identified 


Observed 

ratio 
(d0/d8)* 


MeantSD 


Expected 

ratio 
(d0/d8) = 


% 
error 


LCA_BOVIN 


ALC#SEK 
C#EVFR 

FLDDLTDDIMC#VK 


0.94 
1.03 
0.92 


0.96±0.06 


1.00 


4.2 


OVAL_CHICK 


ADHPFLFC#IK 
YPILPEYLQC#VK 


1.88 
1.96 


1.9210.06 


2.00 


4.0 


BGAL_ECOLI 


LTAAC#FDR 

IGLNC#QLAQVAER 

1 1 FDGVNS AFHLWC#NGR 


1.00 
0.91 
1.04 


0.9810.07 


1.00 


2.0 


LACB_BOVIN 


WENGEC#AQK 
LSFNPTQLEEQC#HI 


3.64 
3.45 


3.5510.13 


4.00 


11.3 


G3P_RABIT 


VPTPNVSWDLTC#R 
IVSNASC#TTNC#LAPLAK 


0.54 
0.57 


0.5610.02 


0.50 


12.0 


PHS2_RABIT 


IC#GGWQMEEADDWLR 
TC#AYTN HTVLPEALER 
WLVLC#NPGLAEIIAER 


0.32 
0.35 
0.30 


0.3210.03 


0.33 


3.1 



* Gene names are according to Swiss Prot nomenclature (www.expasv.ch). 

* Ratios were calculated for each peptide as shown in Fig. 3. 

c Expected ratios were calculated from the known amounts of proteins present in each mixture. 

* ICAT-labeled cysteinyl residue. 
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TABLE 3. Protein profiles from yeast rowina on q alartngt> 



Gene 
name* 



Peptide sequence identified 



or ethanol as a carbon sonra 



Observed 

ratio f 
(Eth:Gal) a 



Galactose- Glucose- 
repressed 5 repressed 5 




C#TGGIILTASHNPGGPENDMGIK 
LSIC#GEESFGTGSNHVR 
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SOD! 


r* c u i Li c n o a t m /t* ■#* \ / c a n D u c ki D n 
brnlnbroUAl NoOffVoAornriNrri\ 


U.*rO . 


1 
1 


y 


TFF1 


l\OI ti Vwir\3Unl\ 

C#GGIDK 

FVPSKPMC#VEAFSEYPPLGR 


0.81 : 
0.70 : 
0.74: 


1 
1 

; 1 




VMA2 


IPIFSASGLPHNEIAAQIC#R 


0.70: 


; 1 




YHB1 


HYSLC#SASTK 


0.69 


: 1 





* Gene names are according to the Yeast Proteome Database (YPD) (19). 

# Cysteinyl residue is ICAT-labeled. 

f Protein expression ratios were calculated as described in Fig. 3. 

= Carbon source for yeast growth was 2% ethanol (Eth) or 2% galactose (GAL0. 

§ Gene is known to be galactose- or glucose-repressed (19). 

11 Eight other ribosomal proteins were detected at similar gene expression levels. 
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TABLE - 4 


1 Disease ! 


Enzvme 


DvsfuncHnn 




Butyrylchoiinesterase deficiency 


BCHE 


Decreased or absent 
enzvme activity 


Essential fructosuria hepatic 
fructokinase deficiency 


Fructokinase 


Deficient enzyme activity 


Hereditary fructose intolerance 


Fructose 1,6- 
bisphosphate aldolase B 


Deficient enzyme activity 


. Hereditary fructose 1 ,6- 
I diphosphatase deficipnrv 


Fructose 1,6- 
bisphosphatase 


Deficient enzyme activity 


Erythrocyte aldolase deficiency 
with nonspherocvtic hemolvtir 
anemia (aldolase A deficiency) 


Fructose 1 ,6- 
Diphosphate aldolase A 


Deficient enzyme activity 


Glycoqen storaae disease tvno 
la (von Gierke disease) 


Glucose 6-phosphatase 


Absent or deficient 
enzyme activity 


Glycoqen storaae disease* tvno 
lb 


Glucose 6-phosphate 
translocase 


Deficient transport of 
glucose 6-phosphate 
across the membrane of 
endoplasmic reticulum 

|— * w w IIVm 1 \A III 


Glycogen storage disease type 
III 


Amylo-1 , 6-giucosidase 
(debrancher enzyme) 


Absent or deficient 
enzyme activity 


Glycogen storaae diseasp tvnp 
IV (Andersen disease) 


a-1 , 4 glucan-6-a- 
glucosyltransferase 


Deficient enzyme activity 


Glycogen storaae disease tvne 
V (McArdle disease) 


Muscle glycogen 
phosphorylase 


Absent or deficient 
enzyme 


Glycogen storaae disease X- 
linked phosphorylase kinase 
deficiency 


rnospnorylase 6-kinase 


Deficient or absent 
enzyme activity function 


Glycogen storage disease 
autosomal phosphorylase 
kinase deficiency 


Phosphorylase b-kinase 


Deficient enzyme activity 


Glycogen storage disease liver 
phosphorylase deficiency 


Liver phosphorylase 


Deficient enzyme activity 


Glycogen storage disease type 
VII (Tarui disease) 


Muscle 
phosphofructokinase 1 


Deficient enzyme activity 


Liver glycogen synthase 
deficiency 


Liver glycogen synthase 


Unknown 
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TABLE - 4 



Disease 



Enzyme 



Dysfunction 



Phosphoglycerate kinase 
deficiency 


D hosDhoalvcerate kinase 


Deficient enzyme 


Phosphoglycerate mutase 
deficiency 


Phosphoglycerate 
mutase 


Deficient enzyme 


Muscle lactate dehydrogenase 
deficiency 


Muscle-specific subunit of 
lactate dehydrogenase 
(LDH) 


Absence of M subunit of 
LDH. Muscle LDH is a 
tetramer of the heart- 
specific subunit 


Glucose phosphate isomerase 
deficiency 


Glucose phosphate 
isomerase 


Unknown 


Transferase deficiency 
galactosemia 


Galactose 1 -phosphate 
uridyltransferase 


Deficient enzyme activity 


Galactokinase deficiency 
galactosemia 


Galactokinase 


Deficient enzyme activity 


Epimerase deficiency 
galactosemia 


Uridine diphosphate 
galactose-4-epimerase 


Deficient enzyme action 

in blood cells only 
(benign) or, more rarely, 
in all tissues (generalized) 


Phenylketonuria (PKU) due to 
PAH deficiency 


Phenylalanine 
hydroxylase (PAH) 


Deficient or absent PAH 
activity (<1% normal) 


Hyperphenylalaninemia due to 
DHPR-deficiency 


Dihydropteridine 
reductase (DHPR) 


Deficient or absent DHPR 
activity 


Hyperphenylalaninemia due to 
GTP-CH-deficiency 


Guanosine triphosphate 
cyclohydrolase (GTP-CH) 


Deficient enzyme activity 


Hyperphenylalaninemia due to 
6-PTS-deficiency 


6-Pyruvoyl 
tetrahydropterin synthase 
(6-PTS) 


Deficient enzyme activity 


Oculocutaneous tyrosinemia 
(tyrosinemia type II; tyrosine 
amino-transferase deficiency) 


Tyrosine 
aminotransferase 


Decreased activity 


4-Hydroxyphenylpyruvic acid 
dioxygenase (tyrosinemia type 
III) 


4-Hydroxy-phenylpyruvic 
acid dioxygenase 


Decreased activity 


Maleylacetoacetate isomerase 
deficiency (tyrosinemia type lb) 
(tentative) 


Maleylacetonacetate 
isomerase 


Presumably decreased 
enzyme activity 
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TABLE -4 



I _ Disease ~~] 


Enzyme 


Dvsflinrtirm 




H6D3t0r6nal tvrncinomin 

(tyrosinemia type 1: 
fumarylacetoacetate hydrolase 
deficiency) 


Fumarylacetoacetate 
hydroxylase 


Deficient enzyme activity 


Carbamyl phosphate synthetase 
deficiency 


Carbamyl phosphate 
svnthpta^P 1 


Absent or deficient 
enzyme activity 


Ornithine transcarbamylase 
deficiency 


Ornithine 
transcarbamylase 


Absent or reduced 
enzyme activity 


Argininosuccinic acid 
synthetase deficiency 


Argininosuccinic acid 
synthetase 


Deficient enzyme activity 


Argininosuccinase deficiency 


Argininosuccinate lyase 


Deficient enzyme activity 


Arginase deficiency 


Liver arginase 


Deficient enzyme activity 


Familial hyperlysinemia (variant- 
saccharopinuria) 


u-Mminoaaipic 
semialdehyde synthase 


Deficient enzyme activity 


Maple syrup urine disease 
(MSUD) or branched chain 
ketoarirtpmin 


Branched-chain a-keto 
acid dehydrogenase 


Deficient or absent (<2%) 
BCKAD complex activity 

in mitochondria; 
immunologic absence or 
reduced levels of enTvmp 
subunits; impairment of 
E1 subunit assembly 


Cystathionine p-synthase 
deficiency 


Cystathionine P-synthase 


Deficient enzyme activity 


a-Cystathionase deficiency 


cx-Cvstathiona«5p 


Deficient enzyme activity 


Hepatic methionine 
adenosyltransferase deficiencv 


Isoenzyme of methionine 
adenosvltran^fpracn 


Deficient enzyme activity 


Sarcosinemia 


Sarcosine 
dehydrogenase? 


Deficient enzyme activity 


Nonketotic hyperglycinemia 


Glycine cleavage system 


Deficient enzyme activity 


Hyperuracil thyminuria 


Dihydropyrimidine 
dehydrogenase 


Deficient enzyme activity 


Dihydropyrimidinuria 


Dihydropyrimidinase 


Unknown 


Pyridoxine dependency with 
seizures 


Brain glutamic acid 
decarboxylase-1 


Deficient coenzyme 
binding? (brain) 
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Disease Enzyme 


Dysfunction 




GABA aminotransferase 
deficiency 


GABA-oc-ketoglutarate 
transaminase | 


Deficient enzyme activity 


4-Hydroxybutyric aciduria 


Succinic semialdehyde 
dehydrogenase j 


Deficient enzyme activity 


Serum carnosinase deficiency 
and homocarnosinosis 


Serum carnosinase j 


Deficient enzyme 


Alkaptonuria 


Homogentisic acid 
oxidase 


Absent or deficient 
enzyme activity | 


Isovaleric acidemia 


Isovaleryl-CoA j 
oenyarogenase i 


Deficient enzyme activity, 

□ciiuitJiu pruimii, 
abnormal peptide size J 


Isolated 3-methylcrotonyl-CoA 
carboxylase deticiency 


3-Methylcrotonyl-CoA 
caruoxyiase 


Deficient enzyme activity 


3-Methylglutaconic aciduria 
Mild form: 


3-Methylglutaconyl-CoA 
hydratase ] 


Deficient enzyme activity 


3-Hvdroxv-3-methvlalutarvl-CoA 
lyase deficiency 


3-Hydroxy-3- 
methylglutaryl-CoA lyase 


Deficient enzyme activity 


Mevalonic aciduria 


Mevalonate kinase 


Deficient enzyme activity J 


Mitochondrial acetoacetyl-CoA 
thiolase deficiency 


Mitochondrial 
acetoacetyl-CoA thiolase 
(T2) 


j Deficient enzyme activity, 

decreased protein, 
j unstable protein j 


Propionic acidemia (2 nonallelic 
forms designated pcc>4 and 
pccBC) 


Propionyl-CoA 
carboxylase (PCC) 


Deficient enzyme activity 
(nonallelic forms reflect 
mutations in nonidentical 
j subunits of PCC) j 


Methylmalonic acidemia (2 
allelic variants designated mat 0 
and mut 


Methyldmalonyl-CoA 
mutase (MUT) 
apoenzyme 


Absent MUT activity in 
mut° t deficient MUT 
activity due to reduced 

affinity for cofactor 
(adenosylcobalamin) in 
1 muf j 


Glutaric acidemia type 1 


Glutaryl-CoA 
dehydrogenase 


Deficient enzyme activity 


Cytochrome oxidase deficiency 


Cytochrome oxidase 
polypeptides 


Decreased activity of the 
cytochrome oxidase 
1 complex j 
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Pyruvate dehydrogenase 

complex deficiency^ 
decarboxylase compo nent 

Pyruvate dehydrogenease E 2 
transacylase 



Pyruvate decarboxylase, 



Combined a-ketoacid 

dehydrogenase 
deficiency/lipoamide 



Dihydrolipoamide 
transacylase 

Lipoamide 
dehydrogenase 



Pyruvate carboxylase deficipnrv 


Pyruvate carboxylase 


1 Carnitine palmitoyl transferase 1 
(CPT 1) deficiency 


Carnitine palmitoyl 
I transferase 1 


Carnitine/acvlcarnitine 
translocase deficiency 


Carnitine/ acylcarnitine 
translocase 


Carnitine palmitoyl transferase II 

(CPT in HAfioiort^w 

I \ ur • *ij ueiiciency 


Carnitine palmitoyl 
transferase II 


j Very long-chain acyl-CoA 
dehydrogenase (VLCAD) 
deficiency 


Very long-chain acyl-CoA 
dehydrogenase 


Long-chain acyl-CoA 
dehydrogenase (LCAD) 


Long-chain acyl-CoA 
dehydrogenase 


Long-chain L-3-hydroxyacyl- 
CoA dehydrogenase (LCHAD) 
1 deficiency 


L-3-hydroxyacyl-CoA 
dehydrogenase 


"Afunctional enzyme (TFE) I 
deficiency 


Trifunctional enzyme 


Dienolyl-Co reductase j 
deficiency 


2,4-dienoyl-CoA 
reductase 


Medium-chain acyl-CoA ! 
dehydrogenase (MCAD | 
deficiency 


Medium-chain acyl-CoA 
dehydrogenase 


Short-chain acyl-CoA 1 
dehydrogenase (SCAD) 
,. deficiency 


Short-chain acyl-CoA 
dehydrogenase 



Decreased enzyme 
activity, decreased 
protein 



Decreased enzyme 
activity; abnormal protein 
electrophoretic mobility 

Decreased enzyme 
activity 



Absent enzyme activity; 7 
cases absent enzyme, 
protein, and mRNA 



Deficient enzyme 



Deficient translocase 
Deficient enzyme 
Deficient enzyme 

Deficient enzyme 
Deficient enzyme 

Deficient enzyme 
Deficient enzyme 
Deficient enzyme 

Deficient enzyme 
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Disease 



Enzyme 



Dysfunction 



Glutaric acidemia type II 


Electron transfer 
flavoprotein (ETF); 
ETRubiquinone 
oxidoreductase 


In some cases, no 
enzyme antigen; in 
others, no enzyme 
activity 


Glycerol kinase deficiency (Gkd) 


Glycerol kinase 


The microdeletion 
involves not only GK but 
also the other deleted 

IOCI. ArlLr, UMU, VI O, 

and other linked loci 


Primary gout: superactive 
variant of 
phosphoribosylpyrophosphate 
(PP-nbose-P) synthetase 


PP-ribose-P synthetase 


Enhanced enzyme 
activity 


Primary gout: partial deficiency 
of hypoxanthine guanine 
phosphoribosyltransferase 
(HPRT) 


Hypoxanthine guanine 
phosphoribosyl 
transferase (HPRT) 


Absent or deficient 
enzyme activity 


Lesch-Nyhan syndrome: 
deficiency of hypoxanthine 

guanine 
phosphoribosyltransferase 
(HPRT) 


Hypoxanthine guanine 
phosphoribosyltrans- 
terase (nrK l j 


Absent or deficient 
enzyme activity 


2,8-Dihydroxyadenine lithiasis 
(adenine 
phosphoribosyltransferase 
deficiency) 


Adenine 
phosphoribosyltransferas 
e 


lype i. aosent enzyme 
activity; type II: reducted 
affinity for PP-ribose-P 


Adenosine deaminase 
deficiency with severe combined 
immunodeficiency disease 


Adenosine deaminase 


Absent or greatly 
I diminished enzyme 
activity 


Purine nucleoside 
phosphorylase deficiency with 
cellular immunodeficiency 


Purine nucleoside 
phosphorylase 


Aosent or greatly 
diminished enzyme 
activity 


Myoadenylate deaminase 
deficiency 


Myoadenylate deaminase 
(AMPDI) 


No enzyme activity; on 
immunoreactive protein 


Xanthinuria 


Xanthine dehydrogenase 
(xanthine oxidase) 


Type 1: absent xanthine 
dehydrogenase activity; 
type II: absent xanthin 
dehydrogenase and 
aldehyde oxidase activity 
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Hereditary orotic aciduria 


UMP synthase 


Deficient enzyme activity 
(unstable protein) 


Pyrimidine S'-nucleotidase 
deficiency 


Pyrimidine S'- 
nucleotidase 


Absent or un^tahip 
enzyme 


Dihydropyrimidine 
aenyarogenase deficiency 


Dihydropyrimidine 
dehydrogenase 


Absent or un^tahio 
enzyme 


Dihydropryimidase deficiency 


Dihydropyrimidase 


Absent or un<*tahie 
enzyme 1 


Familial lipoprotein lipase 
deficiency 


Lipoprotein lipase 


Nonfunctional nrntoin in 

some, nondectable 
enzyme activity and 
protein in others 


Familial lecithin: chloesterol 
acyltransferase deficiency 


Lecithinrcholesterol 
acyltransferase 


Absent enzyme protein or 
deficient enzyme activity 


5-Aminoievulinic acid 
dehydratase porphyria 


5-Aminolevulinic acid 
dehydratase 


Minimal enzyme activity 


Acute intermittent porphyria 


Porphobilinogen 
deaminase 


Decreased enzyme 
activity (--50%) 


Congenital erythropoietic 
porphyria 


Uroporphyrinogen III 
cosynthase 


Minimal enzyme activity 


Porphyria cutanea tarda (familial 
form) 


Uroporphyrinogen 
decarboxylase 


Decreased enzyme 
activity (-50%) 


Hepatoerythropoietic porphyria 


Uroporphyrinogen 
decarboxylase 


Minimal enzyme activity 


Hereditary coproporphyria 


Coproporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Variegate porphyria 


Protoporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Erythropoietic protophorphyria 


Ferrochelatase 


Decrease enzyme activity 
(-50%) 


Crigler-Najjar syndrome, type 1 


Bilirubin UDP- 
glucuronosyltransferase 


Absent enzyme activity 


Crigler-Najjar syndrome, type II 


Bilirubin UDP- 
glucuronosyltransferase, 


Markedly reduced 
enzyme activity 



68 



WO 00/11208 



PCT/US99/19415 



TABLE -4 



Disease 



Enzyme 



Dvsfunction 



Gilbert syndrome 


Bilirubin UDP- 
glucuronosyltransferase 
activity 


Reduced enzyme activity 


Refsum disease 


Phytanic acid a- 
hydroxylase 


Deficient enzyme activity 


Primary hyperoxaluria type 1 


Alanine-glyoxylate 
aminotransferase 


Loss of enzyme catalytic 
activity and aberrant 
subcellular distribution 


Primary hyperoxaluria type 2 


Glyoxylate reductase/D- 
gtycerate dehydrogenase 


Loss of enzyme catalytic 
activity 


(3m2 gangliosidosis: 
hexosaminadase a-subunit 
deficiency (variant B, Tay-Sachs 
disease) 


p-hexosaminidase 


Absent or defective 
hexosamininidase A (aP) 
activity 


Glycogen storage disease type 
II 


a-glucosidase 


Absent or deficient 
enzyme activity 


Mucopolysaccharidosis 1 
(Hurler, Scheie, and Hurler- 
Sheie syndromes, MPS, MPS 
IS, MPS IH/S) 


a-L-iduronidase 


Absent enzyme activity 


Mucopolysaccharidosis II 
(Hunter syndrome) 


Iduronate sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis in 
(Saniflippo syndrome) types A, 
B, C and D 


ill A. Heparan /v-sultatase 
IIIB: a-/V-acetyl- 
glucosaminidase 
IIIC: Acetyl-CoA: a- 
glucosaminide 
acetyltransferase 
HID: N-acetyl- 
glucosamine-6-sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis IV 
(Morquio syndrome) types A 
and B 


IVA: Galactose 6- 
sulfatase 
IVB: P-Galactosidase 


Absent enzyme activity 


Mucopolysaccharidosis VI 
(Maroteaux-Lamy syndrome) 


N-acetyl-galactosamine 
4-sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis VII (Sly 
syndrome) 


P-glucuronidase 


Absent enzyme activity 



69 



enncTrn itc eueer /«•■■ 



WO 00/11208 



PCT/US99/19415 



TABLE -4 



Disease 



Enzyme 



Dysfunction 



l-cell disease (ML-II) 


N-acetylglucosaminyl-l- 
phosphotransferase 


Phosphorylation of many 
lysosomal enzymes 


Schindler disease (a-N-ac^M. 
1 galactosaminidase deficiency) 


a-/v-acetyl- 
| galactosaminidase 


Deficient activity of a-A/- 
acetylgalactosaminidase 


1 a-Mannosido^k 

1 B-Mannosidn^i<; 
1 Sialidosis 


a-D-mannosidase 
I p-D-mannosidase 


Deficient or unstable 
enzyme activity 

Deficient enzyme activity 


1 Aspartylgiucosaminuria 
Fucosidosis 


oneuraminidase 
1 r^opai lyyiuuusaminiuase 
I a-L-fucosidase 


Deficient enzyme activity 
Deficient enzyme activity 
Deficient enzyme activity 


Wolman disease and cholesteryl 
ester storage disease 


Acid lipase 


Deficient enzyme activity 


1 Ceramidase deficiency (Farber 
| lipogranulomatosis) 


[ Ceramidase 


Deficient enzyme activity 


Niemann-Pick disease (NPD) 
types A and B (primary 
j sphingomyelin storage) 


Sphingomyelinase 


Deficient 
sphingomyelinase activity 


Gaucher disease type 1 
j (nonneuronopathic) i 


Glucocerebrosidase 


Decreased catalytic 
activity and some 
instability of enzyme 
protein 


Globoid-cell leukodystrophy 
(Krabbe disease) 

Metachromatic leukodystrophy 

Fabry disease j 


Galactosylceramidase 

ArylsulfataseA 
a-Galactosidase A 


Absent enzyme activity 


Deficient enzyme activity 

Nonfunctional or unstable 
enzyme protein 


1 g mi gangliosidosis [ 


Acid P-galactosidase 

(GLBI) 


Deficient enzyme activity 


1 G M2 gangliosidosis: } 
! hexosaminidase a-subunit } 

deficiency (variant B, Tay-Sachs 
1 disease) 


P-hexosaminidase 


Absent or defective 
hexosaminidase A (aP) 
activity 


Steroid 21 -hydroxylase 
L deficiency salt-losing form 


Steroid 21 -hydroxylase 


Absent or truncated 
enzyme with no activity 
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Disease 



Enzyme 



Dysfunction 



Steroid 5a-reductase 2 
deficiency 


Steroid 5a-reductase 2 


Absent or unstable 
enzyme activity 


Steroid sulfatase deficiency (X- 
linked ichthyosis) 


33-hydroxysteroid 
sulfatase 


Absent immunoreactive 
and enzymatically active 
protein (both deletion and 
nondeletion patients) 


Methylenetetrahydrofolate 
reductase deficiency 


Methylenetetrahydro- 
folate reductase 


Absent or deficient 
enzyme activity. 
Thermolabile variants 
have been described. 


Holocarboxylase synthetase 
deficiency 


Holocarboxylase 
synthetase 


Deficient holocarboxylase 
synthetase activity 


Biotinidase deficiency 


Biotinidase 


Deficient biotinidase 
activity 


Hereditary methemoglobinemia 

secondary to cytochrome b 5 
reductase deficiency, types 1, II, 
and III 


Cytochrome b 5 reductase 


Deficient enzyme activity 
in erythrocyte cytosol only 
(type I), in all tissues 
(type II), and in all 
hematopoetic cells (type 
III) 


Pyruvate kinase deficiency 
hemolytic anemia 


Pyruvate kinase 


Deficient enzyme activity 


Hexokinase deficiency 
hemolytic anemia 


Hexokinase 


Deficient enzyme activity 


Glucosephosphate isomerase 
deficiency hemolytic anemia 


Glucosephosphate 
isomerase 


Deficient enzyme activity 


Aldolase deficiency hemolytic 
anemia 


Aldolase (A type) 


Deficient enzyme activity 


Triosephosphate isomerase 
deficiency hemolytic anemia 


Triosephosphate 
isomerase 


Enzyme activity deficient 
in all tissues 


Phosphoglycerate kinase 
deficiency hemolytic anemia 


Phosphoglycerate kinase 


Deficient enzyme activity 
in hemizygotes 


2,3-Diphospho-glyceromutase 
and phosphatase deficiency 


2,3-Diphospho-giycerate- 
mutase and phosphatase 
(1 protein) 


Deficient enzyme activity 


6-Phosphogluconate 
dehydrogenase deficiency 


6-Phosphogluconate 
dehydrogenase 


Enzyme activity 
deficiency • 
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L Disease 


Enzvme 






Glutathione peroxidase 
deficiency 


Glutathione peroxidase 


Diminished enzyme 
activity 


Glutathione reductase 
deficiency 


Glutathione reductase 


Deficient enzyme activity 


Glutathione synthetase 
deficiency hemolytic anemia 


fall itoiklMMA ll_ J. 

Glutathione synthetase 


Deficient enzyme activity 


Y-Glutamylcysteine svntheta<?P 
deficiency hemolytic anemia 


Y-Glutamylcysteine 
synthetase 


Deficient enzyme activity 


Adenosine deaminase 
hyperactivity hemolytic anemia 


Adenosine deaminase 


Overproduction of 
structurally normal 
enzyme protein mediated 
at mRNA translation level 


Pyrimidine nucleotidase 
deficiency hemolytic anemia 

J »wi » «WI J UU Ul Iwl 1 lid 


Pyrimidine nucleotidase 


Deficient enzyme activity 


Myeloperoxidase deficiency 


Myeloperoxidase 


Absent or dpfiripnt 
enzyme activity 


Carbonic anhydrase II 
deficiency syndrome 
(osteopetrosis with renal tubular 
acidosis) 


Carbonic anhydrase II 


Quantitative deficiency of 
carbonic anhydrase II 


Albinism, oculocutaneous 
tyrosinase-negative type 
(OCAIA) 


Tyrosinase 


Absent, reduced, or 
unusual enzyme activity 


Canavan disease 


Aspartoacylase 


Deficient enzyme activity 



72 



WO 00/1 1208 PCT/US99/19415 



TABLE 5. Molecular masses of protonated and sodiated substrate-conjugates, products, and internal 
standards for CDGS enzymes. 

m/z 

Enzyme Substrate Product Internal 

standard 

(M + H)* (M + Na) + (M + H) + (M + Na) + (M + H) + (M + Na) + 

Typela.b 711 733 549 571 555 577 

Manose- 725 747 563 585 570 592 

transferase 

Type II 1156 1178 1343 1365 1348 1370 

Type 1126 1U8 2362 a 2384 a 2367 a 2389 a 

a Calculated for the GlcNAc«T II product and internal standard containin a GlcNAc-GlcNAc-Mannose- 
(Mannose-GicNAc) 2 residue. 
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Scheme 1 




LINKER (heavy has 8 deuteriums) 



FUNCTIONAL GROUP 



FI r*t£ f °V { the Pr ° CedUre f0r q^titative protein profile 
measurements based on stable isotope ratios by mass spectrometry 



Sample protein mixture 



Reference Protein mixture 



Derivatize cysteines 
with light isotope reagent 



Derivatize cysteines 
with heavy isotope reagent 



Enzymatically cleave protein 



Affinity isolation of derivatized peptides 

~~l 



Analysis and quantitation of derivatized peptides by LC-MS/MS 
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Scheme 3 




HN^NH 



o o 



H 




(HI) 
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Scheme 4 



. Linker/ | Enzyme 

( ^andl§ )^/v\ Substrate 1 




Linker/ j 

( Handjg Xwv( Substrate 1 



Enzyme Linker 

► (Jandj^vwu 



Scheme 5 

Structures of Substrate Conjugates 1 and 2, Product Conjugates 3 and 4, and Internal 
Standards 5 and 6 
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Scheme 7 





(CH 2 )i2CH 3 



HO 

Galactosyl- 
Cerebroside 



Glucose 



ATP 



Scheme 8 

Glucose-6-P Fructose-6-P 

PMlb 

ATP 

Mannose Mannose-6-P 



PMM2 



GTP 



Dol-P-mannose ) GDP-mannose « — Mannose- 1-P 



GDP Dol-P 



Dol = Dolichol 
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Scheme 9 




R = SiMe 2 t-Bu 



1. (CF 3 COO) 2 Pd 
OR benzoqumone, *" 

AcOH 

2. CH 3 ONa/CH 3 OH QH 





6 X X 



B-S-Dol 10 -P 



1. Active ester coupling 

2. OH-deprotection 

3. Phosphorylation 



OPO, 



Alkyl = CH 3 
CD 3 
C 2 H 5 
C 2 D 5 
X = HorD 



Scheme 10 
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Scheme 1 1 
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T 

HN NH 




BF 3 , Et 2 0 
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Scheme 12 (continued) 



OAC 




Scheme 13 

o 




NH 2 -Try-Gln-Ser-Asn-Ser-Thr-Met-NH 9 

I 

B-S-NH-Tiy-Gln-Ser-Asn-Ser-Thr-Met-NH 2 
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Scheme 15 
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Scheme 17 




15, x = OH 

16, X = CI 

17, X = CN 

18, X=CH 2 NH 2 
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Scheme 18 



NHAC 



19, X = NH 2 20, X=NH 




23, X = 



24, X = 





x 




(X 
D 




D v D 




*CH a 



21, X=CN 

22, X = CH 2 NH 2 

25, X = CN 

26, X = CH 2 NH 2 
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velocities of one or more enzymes in a sample are determined by quantitation of 
affinity tagged enzyme products and which further comprises the step of introducing 
into a sample a known amount of one or more internal standards for each of the 
affinity tagged enzyme products of an enzyme, the velocity of which, is to be 
quantitated. 

7. The method of claim 1 wherein the released affinity tagged components are 
separated by chromatography prior to detecting and identifying the components by 
mass spectrometry. 

8. The method of claim 1 in which a plurality of proteins or protein functions in one 
sample are detected and identified. 

9 The method of claim 1 further comprising a step in which one or more of the proteins 
in a sample are chemically or enzymatically processed to expose a functional group 
that can react with an affinity tag. 

10. The method of claim 9 wherein the PRG is an enzyme substrate for one or more 
enzymes, the deficiencies of which, are linked to a disease state. 

1 1 The method of claim 9 wherein an affinity tagged, substantially chemically identical 
and differentially isotopicatly labeled enzyme substrate is provided for each enzyme 
that is to be detected and identified in a sample. 

12. The method of claim 1 wherein PRG is a protein reactive group that selectively reacts 
with certain protein functional groups and a plurality of proteins are detected and 
identified in a single sample. 

13. The method of claim 1 1 wherein two or more affinity tagged, substantially chemically 
identical and differentially isotopically labeled protein reactive reagents having 
different specificities for reaction with proteins are provided and reacted with each 
sample to be analyzed. 

14. The method of claim 1 3 wherein .all of the proteins in a sample are detected and 
identified. 

1 5. The method of claim 1 wherein the relative amounts of one or more proteins in two or 
more different samples are determined and which further comprises the steps of 
combining the differentially labeled samples, capturing affinity tagged components 
from the combined samples and measuring the relative abundances of the affinity 
tagged differentially labeled proteins or the affinity tagged differentially labeled 
peptides originating from that protein. 

16. The method of claim 1 5 which determines the relative amounts of membrane proteins 
in one or more different samples. 

17. The method of claim 1 5 in which different samples contain proteins originating from 
different organelles or different subcellular fractions. 
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21 . The reagent of claim 20 wherein PRG is a sulfhydryl reactive group or an amine 
reactive group. 

22. The reagent of claim 20 wherein PRG is an enzyme substrate. 

23. The reagent of claim 20 wherein the A-L-PRG is soluble in a sample liquid to be 
analyzed. 

24. The reagent of claim 20 wherein the linker is a cleavable linker. 

25. The reagent of claim 20 which has the general formula: 

A-B1-X1-(CH2)n-[X2-(CH2)m ]x-X3-(CH2)p -X4-B2-PRG 
where: A is an affinity label; 
PRG is a protein reactive group; and 

B1-X1-(CH2)n-[X2-(CH2)m ]x-X3-(CH2)p -X4-B2 is a linker group wherein: 

X1 , X2, X3 and X4, independently of one another, and X2 independently of other X2 , 
can be selected from 0 ( S, NH, NR f NRR'+, CO, COO, COS, S-S. SO, S02, CO-NR'. 
CS-NR\ Si-O, aryl or diaryl groups or X1-X4 may be absent; 
B1 and B2, independently of one another, are optional groups selected from COO, 
CO, CO-NR 1 , CS-NR\ ( CH2 )q-CONR\ (CH2)q-CS-NR\ or (CH2)q; 

n, m, p, q and x are whole numbers that can take values from 0 to about 100, where 
the sum of n+xm+p+q is less than about 100; 

R is an alkyl, alkenyl, alkynyl, alkoxy or an aryl group that is optionally substituted with 
one or more alkyl, alkenyl, alkynyl, or alkoxy groups; and 

R f is a hydrogen, an alkyl, alkenyl, alkynyl, alkoxy or an aryl group that is optionally 
substituted with one or more alkyl, alkenyl, alkynyl, or alkoxy groups 

wherein one or more of the CH2 groups in the linker can be optionally substituted with 
alkyl, alkenyl, alkoxy groups, an aryl group that is optionally substituted with one or 
more alkyl, alkenyl, alkynyl, or alkoxy groups, an acidic group, a basic group or a 
group carrying a permanent positive or negative charge; wherein one or more single 
bonds linking non-adjacent CH2 groups in the linker can be replaced with a double or 
a triple bond and wherein one or more of the atoms in the linker can be substituted 
with a stable isotope. 

26. The reagent of claim 20 wherein the affinity label is biotin or a modified biotin. 

27. The reagent of claim 20 wherein the affinity label is selected from the group 
consisting of a 1 ,2-diol, glutathione, maltose, a nitrilotriacetic acid group, or an 
oligohistidine. 
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28. The reagent of claim 20 wherein the affinity label is a hapten. 

29. The reagent of claim 20 wherein PRG is a sulfhydryl-reactive group. 

30. The reagent of claim 20 wherein PRG is an iodoacetylamide group, an epoxide, an a- 
haloacyl group, a nitnles, a sulfonated alkyl, an aryl thiols or a maleimide . 

31 . The reagent of claim 20 wherein PRG is an amine reactive group, a group that 
reacts with a homoserine lactone or a group that reacts with carboxylic acid group. 

32. The reagent of claim 20 wherein PRG is selected from the groups consistinq of a 
amine reactive pentafluorophenyl ester group, an amine reactive N-hydroxy 
succmimide ester group, sulfonyl halide. isocyanate. isothiocyanante. active ester 
tertafluorophenyl ester, an acid halide, and an acid anyhydride; a homoserine lactone 
reactive pnmary amine group, and an carboxylic acid reactive amine, alcohols or 
2,3,5,6-tetrafluorophenyl trifluoroacetate. 

33. The reagent of claim 20 wherein PRG is a substrate for an enzyme. 

34. The reagent of claim 20 wherein PRG is a substrate for an enzyme the deficiency of 
which is associated with a birth defect. 

35. The reagent of claim 20 wherein PRG is a substrate for an enzyme the deficiency 
of which is associated with a lysosomal storage disease. 

37. The reagent of claim 20 wherein PRG is a substrate for p-galactosidase 
acetyl-a-D-glucosaminidase, heparan sulfamidase, acetyl-CoA-a- D-glu'cosaminide 
N-acetyltransferase or N-acetylglucosamine-6-sulfatase. 

38. The reagent of claim 20 wherein at least one of B1 or B2 is CO-NR' or CS-NR. 

39 " ISLT*??"* 0f daim 20 wherein X1 and X4 are sele cted from NH, NR, and 
NRR +, X3 is O and all X2 groups are O. 

40. The reagent of claim 20 wherein the linker contains a disulfide group. 

41 . The reagent of claim 20 wherein any atom of the linker may be substituted with a 
heavy isotope. 

42. A reagent kit for the analysis of proteins by mass spectral analysis that comprises a 
reagent of claim 20. 

43. The reagent kit of claim 42 that comprises one or more reagents of claim 20. 

44. The reagent kit of claim 42 further comprising one or more proteolytic enzymes for 
use in digestion of affinity tagged proteins. 

45. The reagent kit of claim 42 which comprises a set of substantially chemically 
identical differentially labelled affinity tagged reagents. 
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46. The reagent kit of claim 42 wherein the reagent is an affinity tagged enzyme 
substrate reagent. 

47. The reagent kit of claim 46 which comprises a set of substantially chemically 
identical differentially labeled affinity tagged enzyme substrates. 

48. The reagent kit of claim 47 further comprising a set of substantially chemically 
identical differentially labeled affinity tagged enzyme products. 
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Tandem mass spectrum of a cysteine-modified peptide from a-lactalbumin. 
Modification of the cysteine residue with the custom synthesized biotinylating reagent 
did not affect the ability of the Sequest computer program to correctly match this 
peptide to a database sequence. 
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